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Abstract 

We present a theory of hypoellipticity and unique ergodicity for semilinear parabolic 
stochastic PDEs with "polynomial" nonlinearities and additive noise, considered as 
abstract evolution equations in some Hilbert space. It is shown that if Hormander's 
bracket condition holds at every point of this Hilbert space, then a lower bound on 
the Malliavin covariance operator Mt can be obtained. Informally, this bound can be 
read as "Fix any finite-dimensional projection II on a subspace of sufficiently regular 
functions. Then the eigenfunctions of Mt with small eigenvalues have only a very 
small component in the image of II." 

We also show how to use a priori bounds on the solutions to the equation to obtain 
good control on the dependency of the bounds on the Malliavin matrix on the initial 
condition. These bounds are sufficient in many cases to obtain the asymptotic strong 
Feller property introduced in I HM06I . 

One of the main novel technical tools is an almost sure bound from below on the size 
of "Wiener polynomials," where the coefficients are possibly non-adapted stochastic 
processes satisfying a Lipschitz condition. By exploiting the polynomial structure of 
the equations, this result can be used to replace Norris' lemma, which is unavailable in 
the present context. 

We conclude by showing that the two-dimensional stochastic Navier-Stokes equa- 
tions and a large class of reaction-diffusion equations fit the framework of our theory. 
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1 Introduction 

The over arching goal of this article is to prove the unique ergodicity of a class of 
nonlinear stochastic partial differential equations (SPDEs) driven by a finite number of 
Wiener processes. In this section, we give an overview of the setting and the results to 
come later without descending into all of the technical assumptions required to make 
everything precise. This imprecision will be rectified starting with Section[3]where the 
setting and basic assumptions will be detailed. 

In this article we will investigate non-linear equations of the form 

d 

d t u(x, t) + Lu(x, t) = N(u)(x, t) + ^2 9k(x)W k (t) . (1.1) 

fe=i 

Here L will be some positive selfadjoint operator with compact resolvent. Typical ex- 
amples arising in applications are L = — A or L = A 2 . N will be assumed to be 
a "polynomial" nonlinearity in the sense that N(u) = Y^k=i Nk{u), where Nk is k- 
multilinear. Examples of admissible nonlinearities are the Navier-Stokes nonlinearity 
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(u- V)m or a reaction term such as u — u 3 . The gk are a collection of smooth, time inde- 
pendent functions which dictate the "directions" in which the randomness is injected. 
The {Wk ■ k = 1, . . . , d} are a collection of mutually independent one-dimensional 
white noises which are understood as the formal derivatives of independent Wiener pro- 
cesses through the Ito calculus. We assume that the possible loss of regularity due to 
the nonlinearity is controlled by the smoothing properties of the semigroup generated 
by L. See Assumption IA. 1 I below for a precise meaning. 

On one hand, our concentration on a finite number of driving Wiener processes 
avoids technical difficulties generated by spatially rough solutions since W(x, t) = 
gk(x)Wk(t) has the same regularity in x as the gk which we take to be relatively 
smooth. On the other hand, the fact that W contains only a finite number of Wiener 
processes means that our dynamic is very far from being uniformly elliptic in any sense 
since for fixed t, it( • , t) is an infinite-dimensional random variable and the noise acts 
only onto a finite number of degrees of freedom. To prove an ergodic theorem, we must 
understand how the randomness injected by W in the directions {gk : k = 1, . . . , d} 
spreads through the infinite dimensional phase space. To do this, we prove the non- 
degeneracy of the Malliavin covariance matrix under an assumption that the linear span 
of the successive Lie brackets^ of vector fields associated to N and the gk is dense in 
the ambient (Hilbert) space at each point. This is very reminiscent of the condition 
in the "weak" version of Hormander's "sum of squares" theorem. It ensures that the 
randomness spreads to a dense set of direction despite being injected in only a finite 
number of directions. This is possible since although the randomness is injected in a 
finite number of directions it is injected over the entire interval of time from zero to the 
current time. 

In finite dimensions, bounds on the norm of the inverse of the Malliavin matrix are 
the critical ingredient in proving ergodic theorems for diffusions which are only hy- 
poelliptic rather than uniformly elliptic. This then shows that the system has a smooth 
density with respect to Lebesgue measure. In infinite dimensions, there is no mea- 
sure which plays the "universal" role of Lebesgue measure. One must therefore pass 
through a different set of ideas. Furthermore, it is not so obvious how to generalise the 
notion of the 'inverse' of the Malliavin matrix. In finite dimension, a linear map has 
dense range if and only if it admits a bounded inverse. In infinite dimensions, these two 
notions are very far from equivalent and, while it is possible in some cases to show that 
the Malliavin matrix has dense range, it is hardly ever possible in a hypoelliptic setting 
to show that it is invertible, or at least to characterise its range in a satisfactory manner 
(See HMSVE07I for an linear example in which it is possible). 

The important fact which must be established is that nearby points act similarly 
from a "measure theoretic perspective." One classical way to make this precise is to 
prove that the Markov process in question has the strong Feller property. For a contin- 
uous time Markov process this is equivalent to proving that the transition probabilities 
are continuous in the total variation norm. While this concept is useful in finite dimen- 
sions, it is much less useful in infinite dimensions. In particular, there are many natural 
infinite dimensional Markov processes whose transition probabilities do not converge 
in total variation to the system's unique invariant measure. (See examples 3.14 and 
3.15 from 1HM061 for more discussion of this point.) In these settings, this fact also 
precludes the use of "minorization" conditions such as inf l6 c Vt(x, • ) > cv{ ■ ) for 
some fixed probability measure A and "small set" C. (see |MT93 GM06 1 for more and 

'Recall that, when it is defined, the Lie bracket [G, H](u) = (DH)(u)G(u) - (DG)(u)H(u) for two 
functions G, H from the ambient Hilbert space H to itself. Here D is the Frechet derivative. 
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examples were this can be used.) 

1.1 Ergodicity in infinite dimensions and main result 

In 1HM0 6 I , the authors introduced the concept of an Asymptotic Strong Feller diffusion. 
Loosely speaking, it ensures that transition probabilities are uniformly continuous in a 
sequence of 1-Wasserstein distances which converge to the total variation distance as 
time progresses. For the precise definitions, we refer the reader to 1HM06II . For our 
present purpose, it is sufficient to recall the following proposition: 



Proposition 1.1 (Proposition 3.12 from [HM06]) Let t n and S n be two positive se- 
quences with {t„} non-decreasing and {5 n } converging to zero. A semigroup Vt on a 
Hilbert space TI is asymptotically strong Feller if, for all tp : TL — ► R with ||^||oo an d 
||D<£>|| oo finite one has 



for all n and u € Ti., where C : R+ — > R is a fixed non-decreasing function. 

The importance of the asymptotic strong Feller property is given by the following 
result which states that in this case, any two distinct ergodic invariant measures must 
have disjoint topological supports. Recalling that u belongs to the support of a measure 
p (denoted supp(/i)) if p{B$(u)) > for every 5 > (fl we have: 

Theorem 1.2 (Theorem 3.16 from BHM06IP Let Vt be a Markov semigroup on a Pol- 
ish space X admitting two distinct ergodic invariant measures [i and v. IfVt has the 
asymptotic strong Feller property, then supp(/i) n supp(^) is empty. 

To better understand how the asymptotic strong Feller property can be used to connect 
topological properties and ergodic properties of Vt, we introduce the following form 
of topological irreducibility. 

Definition 1.3 We say that a Markov semigroup Vt is weakly topologically irreducible 
if for all U\,ii2 G Ti there exists a v G H so that for any A open set containing v there 
exists ti, t 2 > with V ti (ui, A) > 0. 

Also recall that Vt is said to be Feller if Vt<p is continuous whenever ip is bounded and 
continuous. We then have to following corollary to Theorem l 1.21 whose proof is given 
in Section|2] 

Corollary 1.4 Any Markov semigroup Vt on Polish space which is Feller, weakly topo- 
logically irreducible and asymptotically strong Feller admits at most one invariant 
probability measure. 

The discussion of this section shows that unique ergodicity can be obtained for a 
Markov semigroup by showing that: 

1 . It satisfies the asymptotic strong Feller property. 

2. There exists an "accessible point" which must belong to the topological support 
of every invariant probability measure. 

2 Here Bg(u) = {v : \\u — v\\ < 8} 



\\DVtMu)\\\ < ctWuWXMoo + 5 n \\r><p\ 



oo 



(1.2) 
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It turns out that if one furthermore has some control on the speed at which solution 
return to bounded regions of phase space, one can prove the existence of spectral gaps 
in weighted Wasserstein-1 metric I H M08I . 

The present article will mainly concentrate on the first point. This is because, by 
analogy with the finite-dimensional case, one can hope to find a clean and easy to verify 
condition along the lines of Hormander's bracket condition that ensures a regularisa- 
tion property like the asymptotic strong Feller property. Concerning the accessibility 
of points however, although one can usually use the Stroock-Varadhan support theo- 
rem to translate this into a deterministic question of approximate controllability, it can 
be a very hard problem even in infinite dimensions. While geometric control theory 
can give qualitative information about the set of reachable points HJur97l IAS04I . the 
verification of the existence of accessible points seems to rely in general on ad hoc 
considerations, even in apparently simple finite-dimensional problems. We will how- 
ever verify in Section liOl below that for the stochastic Ginzburg-Landau equation there 
exist accessible points under very weak conditions on the forcing. 

With this in mind, the aim of this article is to prove the following result: 

Theorem 1.5 Consider the setting of jl.lt on some Hilbert space TL and define a se- 
quence of subsets ofTL recursively by Ao = {gj : j = 1, . . . , d} and 

A fe+ i = A fc U {N m (hi, . . . , h m ) : hj G A fe } . 

If the linear span of Aoc = lj„ >0 A n is dense in H, then the Markov semigroup Vt 
associated to ( li.il ) has the asymptotic strong Feller property. 

The precise formulation of Theorem [T3] is given in Theorems 15.41 and 16.71 below. 
Note that our general theorem is slightly stronger than Theorem 11.51 since it allows 
to consider arbitrary "non-constant" Lie brackets between the driving noises and the 
drift, see (11.4b below. As further discussed in Section [131 or 13.1 1 N m (hi, . . . , h m ) is 
proportional to D/ n • • • D; lm N(u) where is the Frechet derivative in the direction h. 
In turn, this is equal to the successive Lie-bracket of N with the vector fields hi to h m . 

Under the same assumptions as Theorem 11.51 the existence of densities for the 
finite dimensional projections of Vt(x, ■ ) was proven in BBM071 . The smoothness of 
these densities was also discussed in MBM07I . but unfortunately there were two errors 
in the proof which require us to give a close, but modified argument in Section [6] and 
Section|7]to prove the needed results. 

The remainder of this section is devoted to a short discussion of the main techniques 
used in the proof of such a result and in particular on how to obtain a bound of the type 
ll.2l for a parabolic stochastic PDE. 

1.2 A roadmap for the impatient 

Readers eager to get to the heart of this article but understandably reluctant to dig 
into too many technicalities may want finish reading Section 1, then jump directly 
to Section and read up to the end of Section 15.31 to get a good idea of how ( 11.2b 
is obtained from bounds on the Malliavin matrix. Then they may want to go to the 
beginning of Section|6]and read to the end of Section l6~4l to see how these bounds can 
be obtained. 

1.3 How to obtain a smoothing estimate 

A more technical overview of the technique will be given in Section IBT21 below. In a 
nutshell, our aim is to generalise the arguments from [HM06 1 and the type of Malliavin 
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calculus estimates first developed in |MP06| to a large class of semilinear parabolic 
SPDEs with polynomial nonlinearities. Both previous works relied on the particular 
structure of the Navier-Stokes equations. The technique of proof can be interpreted as 
an "infinitesimal" version of techniques developed in |EMS01 KS00| and extended in 
IIBKL01I IMY02I IMat02l IHai02l IBM05I combined with detailed lower bounds on the 
Malliavin covariance matrix of the solution. 

In EMS01I the idea was the following: take two distinct initial conditions uo and 
u' for ( II . lb and a realisation W for the driving noise. Try then to find a shift v belong- 
ing to the Cameron-Martin space of the driving process and such that \\u(t) — u'(t)\\ — > 
as t — > 0, where v! is the solution to ( II. Q driven by the shifted noise W' = W + v. 
Girsanov's theorem then ensures that the two initial conditions induce equivalent mea- 
sures on the infinite future. This in turn implies the unique ergodicity of the system. 
(See also lMat081 for more details.) 

The idea advocated in HHM06I is to consider an infinitesimal version of this con- 
struction. Fix again an initial condition uq and Wiener trajectory W but consider now 
an infinitesimal perturbation £ to the initial condition instead of considering a second 
initial condition at distance 0(1). This produces an infinitesimal variation in the so- 
lution u t given by its Frechet derivative D^u t with respect to u$. Similarly to before, 
one can then consider the "control problem" of finding an infinitesimal variation of the 
Wiener process in a direction h from the Cameron-Martin space which, for large times 
t, compensates the effect of the variation £. Since the effect on u t of an infinitesimal 
variation in the Wiener process is given by the Malliavin derivative of Ut in the direction 
h, denoted by V^ut, the problem in this setting is to find an h(t;, W) G L 2 ([0, oo], R d ) 
with 

E||D e u t - V h u t \\ -> Oas t -> oo , (1.3) 

and such that the expected "cost" of h t is finite. Here, the Malliavin derivative T>hU t 
is given by the derivative in e at e = of u t (W + ev), with v(t) = J*h(s)ds. 
If h is adapted to the filtration generated by W, then the expected cost is simply 
J °° E||/i s || 2 ds. If it is not adapted, one must estimate directly lim supE|| J* h s dW s \\ 
where the integral is a Skorokhod integral. As will be explained in detail in Section l5~2l 
once one establishes ( 11.3b with a finite expected cost h, the crucial estimate given in 
(11.21 > (used to prove the asymptotic strong Feller property) follows by a fairly general 
procedure. 

As this discussion makes clear, one of our main tasks will be to construct a shift h 
having the property ( 11.3b . We will distinguish three cases of increasing generality (and 
technical difficulty). In the first case, which will be referred to as strongly contracting 
(see Section 13. 1.1b . the linearised dynamics contracts pathwise without modification 
(all Lyapunov exponents are negative). Hence h can be taken to be identically zero. 
The next level of complication comes when the system possesses a number of direc- 
tions which are unstable on average. The simplest way to deal with this assumption 
is to assume that the complement of the span of the forced directions (the g^s) is 
contracting on average. This was the case in 1EMS0 1 1 iKSOOl IBKLO 1 1 IMY02I IMat02l 
Hai02 BM05|. We refer to this as the "essentially elliptic" setting since the directions 
essential to determine the system's long time behavior, the unstable directions, are di- 
rectly forced. This is a reflection of the maxim in dynamical systems that the long 
time behavior is determined by the behavior in the unstable directions. Since the noise 
affects all of these directions, it is not surprising that the system is uniquely ergodic, 
see Section 4.5 of [HM06] for more details. 

The last case (i.e. when the set of forced directions does not contain all of the 
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unstable directions) is the main concern of the present paper. In this setting, we study 
the interaction between the drift and the forced directions to understand precisely how 
randomness spreads to the system. The condition ensuring that one can gain sufficient 
control over the unstable directions, requires that the g k together with a collection of 
Lie brackets (or commutators) of the form 

[F,g k ], [[F,g k ], 9j l [[F,g k ],F], [[[F, g k ], 9j ], 9l ], ■ ■ • (1.4) 

span all of the unstable direction. This condition will be described more precisely in 
Section lo\2l below. In finite dimensions, when this collection of Lie brackets spans the 
entire tangent space at every point, the system is said to satisfy the "weak Hormander" 
condition. When this assumption holds for the unstable directions (along with some 
additional technical assumptions), we can ensure that the noise spreads sufficiently to 
the unstable directions to find a h capable of counteracting the expansion in the unstable 
directions and allowing one to prove ( 11.31 with a cost whose expectation is finite. 

We will see however that the control h used will not be adapted to the filtration 
generated by the increments of the driving Wiener process, thus causing a number of 
technical difficulties. This stems from the seemingly fundamental fact that because we 
need some of the "bracketed directions" ( 11.4b in order to control the dynamic, we need 
to work on a time scale longer than the instantaneous one. In the "essentially elliptic" 
setting, on the other hand, we were able to work instantaneously and hence obtain an 
adapted control h and avoid this technicality. 

1.4 The role of the Malliavin matrix 

Since the Malliavin calculus was developed in the 1970's and 1980's mainly to give a 
probabilistic proof of Hormander's "sum of squares" theorem under the type of bracket 
conditions we consider, it is not surprising that the Malliavin matrix Ai t = T)u{Du* t 
plays a major role in the construction of the variation h in the "weak Hormander" set- 
ting. A rapid introduction to Malliavin calculus in our setting is given in Section|4] In 
finite dimensions, the key to the proof of existence and smoothness of densities is the 
finiteness of moments of the inverse of the Malliavin matrix. This estimate encapsu- 
lates the fact the noise effects all of the directions with a controllable cost. In infinite 
dimensions while it is possible to prove that the Malliavin matrix is almost surely non- 
degenerate it seems very difficult to characterise its range. (With the exception of the 
linear case HDPZ96I . See also HDPEZ95I IFM951 ICer99l IEH0 1 1 for situations where the 
Malliavin matrix can be shown to be invertible on the range of the Jacobian.) However, 
in light of the preceding section, it is not surprising that we essentially only need the 
invertibility of the Malliavin matrix on the space spanned by the unstable directions, 
which is finite dimensional in all of our examples. More precisely, we need information 
about the likelihood of eigenvectors with sizable projections in the unstable directions 
to have small eigenvalues. Given a projection n whose range includes the unstable 
directions we will show that the Malliavin matrix Ait satisfies an estimate of the form 

P( inf (M t (p,<p) >e\\<p\\ 2 ) =o(e p ) (1.5) 

for all p > 1. Heuristically, this means we have control of the probabilistic cost to cre- 
ate motion in all of the directions in the range of n without causing a too large effect in 
the complementary directions. We will pair such an estimate with the assumption that 
the remaining directions are stable in that the Jacobian (the linearization of the SPDE 
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about the trajectory u t ) satisfies a contractive estimate for the directions perpendicular 
to the range of II. Together, these assumptions will let us build an infinitesimal Wiener 
shift h which approximately compensates for the component of the infinitesimal shift 
caused by the variation in the initial condition in the unstable directions. Once the vari- 
ation in the unstable directions have been decreased, the assumed contraction in the 
stable directions will ensure that the variation in the stable directions will also decrease 
until it is commiserate in size with the remaining variation in the unstable directions. 
Iterating this argument we can drive the variation to zero. 

Note that one feature of the bound ( 11.5b is that all the norms and scalar products 
appearing there are the same. This is a strengthening of the result from |MP06| which 
fixes an error in 1HM061 , see Section|6]for more details. 

The basic structure of the sections on Malliavin calculus follows the presentation in 
HBM07I which built on the ideas and techniques from BMP06I IOco88l . As in all three 
of these works, as well as the present article, the time reversed adjoint linearization is 
used to develop an alternative representation of the Malliavin Covariance matrix. In 
IIOco88l . only the case of linear drift and linear multiplicative noise was considered. 
In OMP06I . a nonlinear equation with a quadratic nonlinearity and additive noise was 
considered. In 1BM07L the structure was abstracted and generalized so that it was 
amenable to general polynomial nonlinearities. We follow that structure and basic 
line of argument here while strengthening the estimates and correcting some important 
errors. 

Most existing bounds on the inverse of the Malliavin matrix in a hypoelliptic situa- 
tion make use of some version of Norris' lemma BKS 841 IKS 85al INor86l IMP06I IBH07I . 
In its form taken from !Nor861 , it states that if a semimartingale Z(t) is small and 
one has some control on the roughness of both its bounded variation part A(t) and its 
quadratic variation process Q(i), then both A and Q taken separately must be small. 
While the versions of Norris' lemma given in |MP06 BM071 IBH07I are not precisely 
of this form (in both cases, one cannot reduce the problem to semimartingales, either 
because of the infinite-dimensionality of the problem or because one considers SDEs 
driven by processes that are not Wiener processes), they have the same flavour in that 
they state that if a process is composed of a "regular" part and an "irregular" part, 
then these two parts cannot cancel each other. This harkens back to the more explicit 
estimates based on estimates of modulus of continuity found in [KS85b, Str83). The 
replacement for Norris' lemma used in the present work covers the case where one 
is given a finite collection of Wiener process Wj and a collection of not necessarily 
adapted Lipschitz continuous processes A a (t) (for a a multi-index) and considers the 
process 

M 

Z(t) = A^(t) + Y, A a (t)W ai (t) ■ ■ ■ W ae (t) . 

1=1 \a\=i 

It then states that if Z is small, this implies that all of the A a 's with \a\ < M are small. 
For a precise formulation, see Section [7] below. It is in order to be able to use this 
result that we are restricted to equations with polynomial nonlinearities. This result on 
Wiener polynomials is a descendant of the result proven in [MP06| for polynomials of 
degree one. In IBM071 . a result for general Wiener polynomials was also proven. Is 
was show there that if Z{t) = for t e [0, T] then then A a (t) = for t € [0, T]. 
This was used to prove the existence of a density for the finite dimensional projections 
of the transition semigroup. In the same article, the same quantitative version of this 
result as proven in the present article was claimed. Unfortunately, there was a error in 
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the proof. Nonetheless the techniques used here are built on and refine those developed 
in IBM071 . 

1.5 Satisfying the Hormander-like assumption 

At first glance the condition that the collection of functions given in ( 1 1.41 ) are dense 
in our state space may seem hopelessly strong. However, we will see that it is of- 
ten not difficult to ensure. Recall that the nonlinearity N is a polynomial of order 
m, and hence, it has a leading order part which is m-homogeneous. We can view 
this leading order part as a symmetric m-linear map which we will denote by N m . 
Then, at least formally, the Lie bracket of N with m constant vector fields is pro- 
portional to N m , evaluated at the constant vector fields, that is N m (hi, ■ ■ ■ , h m ) oc 
[• • • [[F, hi], ■ ■ ■], h m ], which is again a constant vector field. While the collection 
of vector fields generated by brackets of this form are only a subset of the possible 
brackets, it is often sufficient to obtain a set of dense vector fields. For example, if 
N(u) = u — u 3 then N 3 {vi , v 2 , W3) = V1V2V3 and if the forced directions {171 , • • • , gd} 
are C°° then N 3 (hi, h 2 , h 3 ) £ C°° for hi e {gi, ■ ■ ■ , g d }- As observed in IBM07I . to 
obtain a simple sufficient criteria for the brackets to be dense, suppose that A C C°° 
is a finite set of functions that generates, as a multiplicative algebra, a dense subset 
of the phase space. Then, if the forced modes Ao = {gi, ■ ■ ■ ,gd} contain the set 
{h, hh : h,h € A}, the set A^ constructed as in Theorem l 1.51 will span a dense subset 
of phase space. 

1.6 Probabilistic and dynamical view of smoothing 

Implicit in dl.3l ) is the "transfer of variation" from the initial condition to the Wiener 
path. This is the heart of "probabilistic smoothing" and the source of ergodicity when it 
is fundamentally probabilistic in nature. The unique ergodicity of a dynamical system 
is equivalent to the fact that it "forgets its initial condition" with time. The two terms 
appearing on the right-hand side of ( 11.2b represent two different sources of this loss of 
memory. The first is due to the randomness entering the system. This causes nearby 
points to end up at the same point at a later time because they are following different 
noise realisations. The fact that different stochastic trajectories can arrive at the same 
point and hence lead to a loss of information is the hallmark of diffusions and unique 
ergodicity due to randomness. From the coupling point of view, since different realiza- 
tions lead to the same point yet start at different initial conditions, one can couple in 
finite time. 

The second term in ( 11.2b is due to "dynamical smoothing" and is one of the sources 
of unique ergodicity in deterministic contractive dynamical systems. If two trajectories 
converge towards each other over time then the level of precision needed to determine 
which initial condition corresponds to which trajectory also increases with time. This 
is another type of information loss and equally leads to unique ergodicity. However, 
unlike "probabilistic smoothing", the information loss is never complete at any finite 
time. Another manifestation of this fact is that the systems never couples in finite time, 
only at infinity. In Section lB.l.ll about the strongly dissipative setting, the case of pure 
dynamical smoothing is considered. In this case one has ( 11.2b with only the second 
term present. When both terms exist, one has a mixture of probabilistic and dynamical 
smoothing leading to a loss of information about the initial condition. In Section 2.2 
of 1HM08I it is shown how ( 11.2b can be used to construct a coupling in which nearby 
initial conditions converge to each other at time infinity. The current article takes a 
"forward in time" perspective, while jEMSOU [BMOS] pull the initial condition back 
to minus infinity. The two points of view are essentially equivalent. One advantage to 
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moving forward it time is that it makes proving a spectral gap for the dynamic more 
natural. We provide such an estimate in Section lOl for the stochastic Ginzburg-Landau 
equation. 

1.7 Structure of the article 

The structure of this article is as follows. In Section 12 we give a few abstract er- 
godic results both proving the results in the introduction and expanding upon them. In 
Section[3] we introduce the functional analytic setup in which our problem will be for- 
mulated. This setup is based on Assumption lA.il which ensures that all the operations 
that will be made later (differentiation with respect to initial condition, representation 
for the Malliavin derivative, etc) are well-behaved. Section [4] is a follow-up section 
where we define the Malliavin matrix and obtain some simple upper bounds on it. We 
then introduce some additional assumptions in Section |6T1 which ensure that we have 
suitable control on the size of the solutions and on the growth rate of its Jacobian. 

In Section[5] we obtain the asymptotic strong Feller property under a partial invert - 
ibility assumption on the Malliavin matrix and some additional partial contractivity 
assumptions on the Jacobian. Section |6*31 then contains the proof that assumptions on 
the Malliavin matrix made in Section[5]are justified and can be verified for a large class 
of equations under a Hormander-type condition. The main ingredient of this proof, a 
lower bound on Wiener polynomials, is proved in Section [7] Finally, we conclude in 
Section [8] with two examples for which our conditions can be verified. We consider 
the Navier-Stokes equations on the two-dimensional sphere and a general reaction- 
diffusion equation in three or less dimensions. 

2 Abstract ergodic results 

We now expand upon the abstract ergodic theorems mentioned in the introduction 
which build on the asymptotic strong Feller property. We begin by giving the proof 
of Corollarv ll ,4l from the introduction and then give a slightly different result (but with 
the same flavour) which will be useful in the investigation of the Ginzburg-Landau 
equation in Section liPl Throughout this section, Vt will be a Markov semigroup on a 
Hilbert space H with norm | • | . 

Proof of Corollary ]! .41 Since Vt is Feller, we know that for any u g H and open set A 
with Vt (u, A) > there exists an open set B containing u so that 

inf V t (u, A) > . 

Combining this fact with the weak topological irreducibility, we deduce that for all 
u\, i*2 G H there exists v £ TC so that for any e > there exists a S, tx, t% > with 

inf V u (z, B M) > (2.1) 

fori = 1,2. 

Now assume by contradiction that we can find two distinct invariant probability 
measures yUi and ^ for Vt- Since any invariant probability measure can be written as a 
convex combination of ergodic measures, we can take them to be ergodic without loss 
of generality. Picking Ui G supp(/i;), by assumption there exists a v so that for any 
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e > there exists t\, ti and 5 > so that d2.ll ) holds. Since m £ supp(/i.;) we know 
that Hi(Bs(ui)) > and hence 

L H (B e (v))= [ Vtte,B e (v))iii(dz)> [ V ti (z,B e (v))fii(dz) 
Jh JB S (ui) 

> (ii(Bs(Ui)) inf V ti (z, B e (v)) > . 

z<£B s (ui) 

Since e was arbitrary, this shows that v £ supp(/ii) n supp(/i2), which by Theorem ll.2l 
gives the required contradiction. □ 

We now give a more quantitative version of Theorem 1 1.21 It shows that if one has 
access to the quantitative information embodied in (11. 2t , as opposed to only the asymp- 
totic strong Feller property, then not only are the supports of any two ergodic invariant 
measures disjoint but they are actually separated by a distance which is directly related 
to the function C from ( 11.2b . 

Theorem 2.1 Let {Vt} be a Markov semigroup on a separable Hilbert space TL such 
that ( |7.2| ) holds for some non-decreasing function C. Let p\ and p2 be two dis- 
tinct ergodic invariant probability measures for Vt- Then, the bound \u\ — > 
1/C(||ui|| V | j 1 ] ) holds for any pair of points (u\, 11%) with m £ supp/i.;. 

Proof. The proof is a variation on the proof of Theorem 3.16 in |HM06|. We begin 
by defining for u,v £ H the distance d n (u, v) = 1 A (^/S^\\u — w||) where S n is the 
sequence of positive numbers from (11.2l i. As shown in the proof of Theorem 3.12 in 
IHM06I . one has 

) < ||«i - U2||C(||tti|| V ||« 3 ||)(1 + vC) (2.2) 

where d n is the 1-Wasserstein distance^ on probability measures induced by the metric 
d n . Observe that for all u,v £ H, d n {u, v) < 1 and limd„(u, v) = 1{„}(w). Hence by 
in Lemma 3.4 of |HM06J, for any probability measures /i and v, limn^oo d n (fj,, v) = 
dxv(M: v ) where djy(n, v) is the total variation distance- 
Let pi and p2 be two ergodic invariant measures with p,\ ^ p2- By Birkhoff's 
ergodic theorem, we know that they are mutually singular and thus dxv^i, ^2) = L 
We now proceed by contradiction. We assume that there exists a pair of points (iti, 112) 
with Ui £ supp(/i.;) such that — U2W < C(||ui|| V 1 1 1 1 )- We will conclude by 
showing that this implies that d-rv^i, M2) < 1 an d hence fit and /12 are not singular 
which will be a contradiction. 

Our assumption on u\ and 112 implies that there exists a set A containing u\ and 
u 2 such that a = min(pi(A), 112(A)) > and [3 = sup{|| u — v\\ : u, v £ y4}C(|jui|| V 
1 1 1 1 ) < 1. As shown in the proof of Theorem 3.16 in [HM06], for any n one has 

d n (p 1 ,p 2 ) < 1 - a(l - sup d n (Vl5 Vl ,Vl5 V2 )) 

Vi£A 

< 1 - a(l - 0(1 + v^)) 

^dn(v\ , U2) = sup J ipdui — f ipdu2 where the supremum runs over functions ip : "H — » R which have 
Lipschitz constant one with respect to the metric d n . 

4 Different communities normalize the total variation distance differently. Our dxv is half of the total 
variation distance as defined typically in analysis. The definition we use is common in probability as it is 
normalised in such a way that dj\j(p,, v) = 1 for mutually singular probability measures. 
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where the last inequality used the bound in equation ( 12.2b . Taking n — » oo pro- 
duces g?tv(M1j A* 2) < 1 — a (l — /?)■ Since a € (0, 1) and (3 < 1 we concluded that 
(irvCA'ij M2) < 1- This implies a contradiction since p\ and /i2 are mutually singular 



Paired with this stronger version of Theorem 11.21 we have the following version 
of Corollary 1 1 .41 which uses an even weaker form of irreducibility. This is a general 
principle. If one has a stronger from of the asymptotic strong Feller property, one 
can prove unique ergodicity under a weaker form of topological irreducibility. The 
form of irreducibility used in Corollary 12.21 allows the point where two trajectories 
approach to move about, depending on the degree of closeness required. To prove 
unique ergodicity, the trade-off is that one needs some control of the "smoothing rate" 
implied by asymptotic strong Feller at different points in phase space. 

Corollary 2.2 Let {Vt} be as in Theorem \2.1\ Suppose that, for every Rq > 0, it is 
possible to find R > and T > such that, for every e > 0, there exists a point v with 
\\v\\ < R such that Vt(u, B £ (v)) > for every \\u\\ < Rq. Then, Vt can have at most 
one invariant probability measure. 

Proof. Assume by contradiction that there exist two ergodic invariant probability mea- 
sures pi and P2 for Vt- Then, choosing Rq large enough so that the open ball of radius 
Rq intersects the supports of both [11 and it follows form the assumption, by sim- 
ilar reasoning as in the proof of Corollary 11.41 that supp/ii intersects B £ {v). Since 
\\v\\ is bounded uniformly in e, making e sufficiently small yields a contradiction with 
Theorem l2. ll above. □ 

3 Functional analytic setup 

In this paper we introduce the basic function analytic set-up for the rest of the paper. 
We will develop the needed existence and regularity theory to place the remainder of 
the paper of a firm foundation. We consider semilinear stochastic evolution equations 
with additive noise in a Hilbert space H of the form 



Here, the Wk are independent real-valued standard Wiener processes over some prob- 
ability space (fi, P, T). Our main standing assumption throughout this article is that L 
generates an analytic semigroup and that the nonlinearity N results in a loss of regu- 
larity of a powers of L for some a < 1. More precisely, we have: 

Assumption A. 1 There exists a € [0,1) and 7*, /3* > —a (either of them possibly 
infinite) with 7* + > — 1 such that: 

1. The operator L: T)(L) — > TL is selfadjoint and satisfies (u,Lu) > \\u\\ 2 . We 
denote by TL a , a£R the associated interpolation spaces (i.e. TL a with a > i s 
the domain of L a endowed with the graph norm and H— a is its dual with respect 
to the pairing in TL). Furthermore, Tiao is the Frechet space Tioo = HqX) ^ a 
and is its dual. 



measures. 



□ 



d 




(3.1) 
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2. There exists n > 1 such that the nonlinearity N belongs to Poly"(7i 7 + a , 7i 7 )/or 
every 7 £ [—a, 7*) (see the definition o/Poly in Section \3~J] below). In particular, 
from the definition o/Poly(7i 7+a , 7i 7 ), it follows that it is continuous from TL^ 

tO Hca- 

3. For every (3 € [—a, (3*) there exists 7 G [0, 7* + 1) such that the adjoint (in 7i) 
DN*(u) of the derivative DN of N at u (see again the definition in Section \3~l\ 
below) can be extended to a continuous map from 7i 7 to C(Ti./3 +a , 'rip). 

4. One has g k G Tl^^+ifor every k. 

Remark 3.1 If 7* > 0, then the range {3 e [— a,0] for Assumption I A. 1 13 1 follows 
directly from Assumption IA. 1121 since IA. lT3l simply states that for u G Ti 1 , DN(u) is 
a continuous linear map from H-p to H-p-a- 

Remark 3.2 The assumption (it, Lu) > \\u\\ 2 is made only for convenience so that 
L 7 is well-defined as a positive selfadjoint operator for every 7 6 R. It can always be 
realized by subtracting a suitable constant to L and adding it to N. 

Similarly, non-selfadjoint linear operators are allowed if the antisymmetric part 
is sufficiently "dominated" by the symmetric part, since one can then consider the 
antisymmetric part as part of the nonlinearity N. 

Remark 3.3 It follows directly from the Calderon-Lions interpolation theorem 1RS80I 
Appendix to IX.4] that if N G Poly(7Y , W-a)nPoly(7Y 7t+a , H r J for some 7* > -a, 
then N G Poly(7i 7+a , TC 7 ) for every 7 G [—a, 7*]. This can be seen by interpreting N 
as a sum of linear maps from TL® '™ a to r ri 1 for suitable values of n. 

It will be convenient in the sequel to define F by 

F(u) = -Lu + N(u) . (3.2) 

Note that F is in Poly"(H 7 +i, 7i 7 ) for every 7 G [—1,7*). We also define a linear 
operator G : R d —> Hoo by 

d 

Gv = ^2 v k g k , 
fc=i 

for v = (vi, . . . , Vd) G R d . With these notations, we will sometimes rewrite (13. U as 

du = F(u) dt + G dW(t) , uo&n, (3.3) 
for W = (Wi, . . . , Wd) a standard d-dimensional Wiener process. 
3.1 Polynomials 

We now describe in what sense we mean that n is a "polynomial" vector field. Given 
a Frechet space X, we denote by C" (X) the space of continuous symmetric n-linear 
maps from X to itself. We also denote by C(X, Y) the space of continuous linear maps 
from X to Y. For the sake of brevity, we will make use of the two equivalent notations 
P(u) and P(u® n ) for P G C n (X). 

Given Q G C k s , its derivative is given by the following n — 1-linear map from X to 
C(X,X): 

DQ(u)v = kQiv®^-^ ® v) . 
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We will also use the notation DQ* : X — > C(X\ X') for the dual map given by 

(w, DQ*(u)v) = (v, DQ(u)w) = k(v, Q(u® (fc_1) <g> to)) . 

Given P £ and Q £ we define the derivative DQ P of Q in the direction P 
as a continuous map from X x X to X by 

£>Q(u) P(v) = lQ{u m - 1} <g> P(y)) . 

Note that by polarisation, u i— > DQ(u)P(u) uniquely defines an element on 
This allows us to define a "Lie bracket" [P, Q] £ Cg 1 between P and Q by 

[P, Q](u) = DQ(u) P(u) - PP(u) Q(«) . 

We also define Poly"(X) as the set of continuous maps P : X — > X of the form 

n 

P(u) = ]TP (fc) (u), 

fc=0 

with P (A;) € >Cg(X) (here C®(X) is the space of constant maps and can be identified 
with X). We also set Poly(X) = U„>o Poly re (-X"). The Lie bracket defined above 
extends to a map from Poly(X) x Poly(X) — > Poly(X) by linearity. 

3.1.1 Polynomials over H 

We now specialize to polynomials over H. We begin by choosing X equal to the 
Frechet space Hoc, the intersection of H a over all a > 0. Next we define the space 
Poly(7i a , Hb) C Poly(7i oc ) as the set of polynomials P £ Poly(7i oc ) such that there 
exists a continuous map P: 7i a — > Hb with P(w) = P(u) for all u £ Woo. Note 
that in general (unlike Poly(W 00 )), P,Q G Poly(7i a , H&) does not necessarily imply 
[P, Q] £ Poly(7i a , Hb)- We will make an abuse of notation and use the same symbol 
for both P and P in the sequel. 

3.1.2 Taylor expansions and Lie brackets 

We now consider the Taylor expansion of a polynomial Q in a direction g belonging to 
span{<7i, ■ ■ • ,gd} C H lt+ \. Fix Q £ Poly m (W 7 , W/?) for some 7 < 7* + 1 and any 
/3 £ R. For u £ W 7 and it; = (wi , . . . , tu^) £ R d , observe that there exist polynomials 
Q a such that 

d 

Q( V + y^ J 9kWk) = Qg(v)w a , w a = w ai ■ ■ ■ w ae , (3.4) 

fc=l a 

where the summation runs over all multi-indices a = (a\, . . . ,a£), I > with 
values in the index set {1, . . . , d}. It can be checked that the polynomials Q a £ 
Poly m -l a l(W 7 , Hp) are given by the formula 

Q a (v) = — } [[. . . [Q,g ai ] ■ . .],g at ] = - f D^Q(v)(g ai , . . . ,g ai ) , (3.5) 

where a! = ai! • ■ • agl . (By convention, we set Q^ = Q and Q a = if \a\ > m.) 

We emphasize that multi-indices are unordered collections of {1, . . . ,d} where 
repeated elements are allowed. As such, the union of two multi-indices is a well- 
defined operation, as is the partial ordering given by inclusion]! 

5 To be precise, one could define a multi-index as a "counting function" a: {1, . . . ,d} — >N with a(k) 
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3.2 A priori bounds on the solution 

This section is devoted to the proof that Assumption ! A. 1 l is sufficient to obtain not only 
unique solutions to ( 13.11 ) (possibly up to some explosion time), but to obtain further 
regularity properties for both the solution and its derivative with respect to the initial 
condition. We do not claim that the material presented in this section is new, but while 
similar frameworks can be found in |DPZ 92] |Fla95l , the framework presented here 
does not seem to appear in this form in the literature. Since the proofs are rather 
straightforward, we choose to present them for the sake of completeness, although in a 
rather condensed form. 

We first start with a local existence and uniqueness result for the solutions to ( 13.11 ): 

Proposition 3.4 For every initial condition uq G H, there exists a stopping time t > 
such that ( li.il ) has a unique mild solution u up to time t, that is to say u almost surely 
satisfies 

u t = e- Lt u + f e- L(t - s) N(u s )ds+ f e- L(t - s) G dW(s) , (3.6) 
Jo Jo 

for all stopping times t with t < r. 

For notational convenience, we denote by W L (s,t) = £ e~ L(t - r) G dW(r) the 
"stochastic convolution." Since we assumed that gj. G 7Y 7>l +i, it is possible to obtain 
bounds on all exponential moments of sup 0<s<t<T || Wl(s, f)|| 7 for every T > and 
every 7 < 7* + 1. 

Proof. Given a function £ : R + — > Ji and a time T > 0, define a map $t.£ : "H. x 
C([0, T],H) — > C([0, T], H) (endowed with the supremum norm) by 

($T,e(«o, «)) t = e- Lt u + m + f e- L «- s) N(u s ) ds . (3.7) 

Jo 

Since N G Poly(7i, 7i_ a ) by setting 7 ~ —a in Assumption IA. 1 121 and suppressing 
the dependence on uq, there exists a positive constant C such that 

||$r,e(u)-$T,e(fi)||< sup C f (t-*r ||u,-u,||(H-||u.|| + ||u.||) n - 1 dfl 
te[0,T] Jo 

< c\\ u - u\\{i + \\u\\ + imy 1 - 1 ^ . 

Recall that n is the degree of the polynomial nonlinearity N. It follows that, for every 
£, there exists T > and R > such that $t^(uo, •) is a contraction in the ball 
of radius R around e~ Lt uo + £(i). Setting £(t) = Wl(0, t), this yields existence and 
uniqueness of the solution to (13.6b by the Banach fixed point theorem. The largest such 
T is a stopping time since it only depends on the norm of uq and on £ up to time T, 
thus concluding the proof of the proposition. □ 

The remainder of this section is devoted to obtaining further regularity properties 
of the solutions. 



counting the number of appearances of the index k. With this identification, the union of two multi-indices 
corresponds to the sums of their counting functions, while a C j3 means that a(k) < f3(k) for every k. 
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Proposition 3.5 Fix T > 0. For every 7 £ [0, 7* + 1) there exist exponents p 7 > 1 
and q y > anc/ a constant C such that 

\\u t \\ y <Ct-^(l+ sup K||+ sup \\W L (s,r)\\^ (3.8) 

s£[|,t] |<s<r<i 

/or a/Z i £ (0, T]. /« particular, ifj = X)j=o / or ■ some fc G N anc/ <5j € (0, 1 — a) 
f/zera g 7 < X)j=i 8jinP~ 1 . 

Proof. The proof follows a standard "bootstrapping argument" on 7 in the following 
way. The statement is obviously true for 7 = with p 1 = 1 and q 7 = 0. Assume that, 
for some a = ao £ [1/2,1) and for some 7 = 70 £ [0, 7* + a), we have the bound 

lk|| 7 <C*-^(l+ sup \\u s \\+ sup \\W L (s,r)\\ 7 ) p ~< , (3.9) 

s£[at,t] at<s<r<t 

for allf G (0,T]. 

We will then argue that, for any arbitrary 8 G (0, 1 — a), the statement (13. 9t also 
holds for 7 = 70 + 8 (and therefore also for all intermediate values of 7) and a = 0$. 
Since it is possible to go from 7 = to any value of 7 < 7* + 1 in a finite number of 
steps (making sure that 7 < 1 + a in every intermediate step) and since we are allowed 
to choose a as close to 1 as we wish, the claim follows at once. 

Using the mild formulation (13.6b , we have 

u t = e- {1 - a)Lt u at + [ e- L{t - s) N(u s ) ds + W L (at, t) . 

J at 

Since 7 G [0, 7* + a), one has N £ Poly(7Y 7 , 7i 7 _ a ) by Assumption lA. 1 12| Hence, for 

*G(o,n 

||ut|| T +« < Ct- s \\u a t\\^ + \\W L {oct,t)\\-y + C ! (t - s)- {S+a \l + \\u s \\ 7 T ds 

J at 

<C(t~ s + t 1 - s ~ a ) sup (l + || Ms ||; l ) + ||Wi(at,f)|| 7 

at<s<t 

<Ct~ s sup (l + K|| 7 ) + ||Wi(aM)|| 7 . 

at<s<t 

Here, the constant C depends on everything but t and uq- Using the induction hypoth- 
esis, this yields the bound 

\\uth+6<Ct- 5 -^(l+ sup |K||+ sup !|Wi( S ,r)|j 7 ) ,lp - + ||VH(«t,i)|| 7 , 

s£[a 2 t,t] a 2 <s<r<t 

thus showing that d3.9t holds for 7 = 70 + 8 and a = a§ with p~,+$ = np 1 and 
q-y+s = 8 + nq y . This concludes the proof of Proposition |3.5l □ 

3.3 Linearization and its adjoint 

In this section, we study how the solutions to ( 13.11 ) depend on their initial conditions. 
Since the map from (13.7b used to construct the solutions to ( 13. U is Frechet differen- 
tiable (it is actually infinitely differentiable) and since it is a contraction for sufficiently 
small values of t, we can apply the implicit functions theorem (see for example 1RR04I 
for a Banach space version) to deduce that for every realisation of the driving noise, 
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the map u s i— > u t is Frechet differentiable, provided that t > s is sufficiently close to 
s. 

Iterating this argument, one sees that, for any s < t < r, the map u s i— ► u t given 
by the solutions to ( 13. U is Frechet differentiable in H. Inspecting the expression for 
the derivative given by the implicit functions theorem, we conclude that the derivative 
J s ,tf in the direction tp e 7i satisfies the following random linear equation in its mild 
formulation: 

dtJsjsp = -LJ Stt ip + DN(ut)J s ,t¥> > J s ,s<P = <P- (3.10) 
Note that, by the properties of monomials, it follows from Assumption lA. 1 |2l that 

\\DN(u)v\\ y < C(l + ||u|| 7+a ) n - 1 ||i;|| 7+0 , 

for every 7 e [—a, 7*). A fixed point argument similar to the one in Proposition 13.41 
shows that the solution to ( 13.101 ) is unique, but note that it does not allow us to obtain 
bounds on its moments. We only have that for any T smaller than the explosion time 
to the solutions of ( 13. It , there exists a (random) constant C such that 

sup sup ||J S) t<^|| < C . (3.11) 

0<s<t<T \\ip\\<l 

The constant C depends exponentially on the size of the solution u in the interval 
[0, T]. However, if we obtain better control on J s t by some means, we can then use 
the following bootstrapping argument: 

Proposition 3.6 For every 7 < 7* + 1, there exist exponents p 7 , <L > 0, and constants 
C > and 70 < I7I such that we have the bound 

\\J t ,t+sVh<Cs-^ sup (l + ||u t+r .|| 70 )^||J t , t+r ^||, (3.12) 
retf ,«] 

for every ip € Ti and every t, s > 0. If 7 < 1 — a, then one can choose 70 = p 7 = 
and q 7 = n — 1. 

Since an almost identical argument will be used in the proof of Proposition 13.81 
below, we refer the reader there for details. We chose to present that proof instead of 
this one because the presence of an adjoint causes slight additional complications. 

For s < t, let us define operators K s A via the solution to the (random) PDE 

d s K s , t <p = LK Sit <p-DN*(u s )K Sit tp, K t , t <p = (p , <peH. (3.13) 

Note that this equation runs backwards in time and is random through the solution 
u t of ( 13.11 ). Here, DN*(u) denotes the adjoint in H of the operator DN(u) defined 
earlier. Fixing the terminal time t and setting ip s = K t - S ,t<p, we obtain a more usual 
representation for tp„: 

d sVs - -L<p s + DN*(u t _ s )cp s . (3.14) 

The remainder of this subsection will be devoted to obtaining regularity bounds on the 
solutions to ( 13.131 ) and to the proof that K s ,t is actually the adjoint of J s ,t- We start 
by showing that, for 7 sufficiently close to (but less than) 7* + 1, ( 13.131 ) has a unique 
solution for every path u e C(R, 7i 7 ) and tp £ Ti.. 
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Proposition 3.7 There exists 7 < 7* + 1 such that equation H3.13[ has a unique solu- 
tion for every s < t and every u G C(R, 7i 7 ). Furthermore, K s t depends only on u r 
forr G [s,t]. 

Proof. As in Proposition EU we define a map $ T? :Hx C([0, T], H) -> C([0, T],H) 

by ' 

($ T ,^o, ¥>)) t = e- L Vo + / e- L( *- s) (I>Ar(6))^ . 

Jo 

It follows from Assumption IA. 1131 with (3 = —a that there exists 7 < 7* + 1 such that 
DN*(u) : Ti — ► 7i_ Q is a bounded linear operator for every u G 7i 7 . Proceeding as in 
the proof of Proposition l3.4l we see that $ is a contraction for sufficiently small T. 

□ 

Similarly to before, we can use a bootstrapping argument to show that K s .t<P actu- 
ally has more regularity than stated in Proposition [321 

Proposition 3.8 For every (3 < /3* + 1, there exists 7 < 7* + 1, exponents pp,qp > 0, 
and a constant C such that 

\\K t - a ,M\p < ° S ~ P " SU P i 1 + lk-r|| 7 Hl#t-r, t </>ll . (3-15) 
re[§ ,«] 

for every ip G 7i, every t,s>0, and every u G C(R, 7i 7 ). 

Proof. Fix /3 < /?* + a and 5 G (0, 1 — a) and assume that the bound ( 13.151 ) holds for 
H-Ks.t^ll/s- Since we run s "backwards in time" from s = t, we consider again t as 
fixed and use the notation cp s = K t ^ s t tp. We then have, for arbitrary a G (0, 1), 

\\<P.\\f>+s<C8- s \\<p aa \\ p + C f (.8-rT iS+a) \\DN*(,ut-r)(p r \\p-adr, 

J as 

provided that 7 is sufficiently close to 7* + 1 such that DN* : 7i 7 — > C(Hp, TCp-a) by 
Assumption I A. 1 |3l Furthermore, the operator norm of DN*(v) is bounded by C(l + 
\\v\\ y r-\ yielding 

\\<P.\\f>+8 < C S - s y as \\ p + Cs- {S+a) sup + Hwll/j 

<Cs- (5+a) sup (1 + K|| 7 ) n_1 ||¥>r||/S- 

Iterating these bounds as in Proposition B . 5l concludes the proof. □ 

The following lemma appears also in [MP06 BM07]. It plays a central role in estab- 
lishing the representation of the Malliavin matrix given in ( 14.1 11 1 on which this article 
as well as 1MP06I lBM07l rely heavily. 

Proposition 3.9 For every < s < t, K s t is the adjoint of J Sjt in Ti., that is K Sjt = 
T* 

Proof. Fixing < s < t and <p, ijj G Hoc, we claim that the expression 

{J s>r ip,K r>t ^) , (3.16) 



Malliavin calculus 



19 



is independent of r E [s, t]. Evaluating ( 13.161 ) at both r = s and r = t then concludes 
the proof. 

We now prove that ( 13.16b is independent of r as claimed. It follows from (13 . 1 3 b 
and Proposition 13 . 5 1 that, with probability one, the map r i— > K r> t(f is continuous with 
values in Hp+i and differentiable with values in Tip, provided that j3 < /?*. Similarly, 
the map r i— > J s . r 4> is continuous with values in and differentiable with values 

in 7i 1 , provided that 7 < 7*. Since 7* + /?* > —1 by assumption, it thus follows that 
(13.16b is differentiable in r for r € (s, t) with 

d r { J s , r <p, K r> til>) = ((L + DN(Ur))J s , r ip, K rt tip) 

- (J s , r tp, (L + DN*(u r ))K r ji)) = . 

Since furthermore both r t— ► K r jip and r i— ► J s . r ^ are continuous in r on the closed 
interval, the proof is complete. See for example 0DL92I p. 477] for more details. □ 

4 Malliavin calculus 

In this section, we show that the solution to the SPDE ( 13.11 ) has a Malliavin derivative 
and we give an expression for it. Actually, since we are dealing with additive noise, we 
show the stronger result that the solution is Frechet differentiable with respect to the 
driving noise. In this section, we will make the standing assumption that the explosion 
time t from Proposition l3.4l is infinite. 

4.1 Malliavin derivative 

In light of Proposition l3.4l for fixed initial condition uq 6 H there exists an "Ito map" 
: C([0, (],R d ) H with u t = ^°{W). We have: 

Proposition 4.1 For every t > and every u £ Ti., the map is Frechet differen- 
tiable and its Frechet derivative D$"w in the direction v € C(R+,R d ) satisfies the 
equation 

dD$> = -LD$> dt + DN(^)D^v dt + Gdv(t) (4.1) 
in the mild sense. 

Remark 4.2 Note that ( 14. jj has a unique 7Y-valued mild solution for every continuous 
function v because it follows from our assumptions that Gv G C(R+,7i 7 ) for some 
7 > and therefore / * e~ L(t ~ s) G dv(s) = Gv(t)- e - Lt Gv(0)- J* Le~ L{t - s) Gv(s) ds 
is a continuous 7i-valued process. 

Proof of Proposition \4.1\ The proof works in exactly the same way as the arguments 
presented in Section [331 it follows from Remark [4721 that for any given uq g TC and 
t > 0, the map 

(W, u) ^ e~ Lt u + [ e~ L(t ~ s) N(u(s))ds+ [ e" L(t ~ s) G dW(s) 
Jo Jo 

is Frechet differentiable in C([0, t], R d ) x C([0, f],H). Furthermore, for t sufficiently 
small (depending on u and W), it satisfies the assumptions of the implicit functions 
theorem, so that the claim follows in this case. The claim for arbitrary values of t 
follows by iterating the statement. □ 
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As a consequence, it follows from Duhamel's formula and the fact that J s t is the 
unique solution to ( 13.10b that 

Corollary 4.3 If v is absolutely continuous and of bounded variation, then 

D$> = [ J s , t Gdv(s) , (4.2) 
Jo 

where the integral is to be understood as a Riemann-Stieltjes integral and the Jacobian 
J Si t is as in 10\) . 



In particular, ( 14.21 ) holds for every v in the Cameron-Martin space 
CM = {v: d t v G L 2 ([0, oo), R d ), v(0) = 0} , 

which is a Hilbert space endowed with the norm ||u||^[ = J °° {dtvit)^ dt = |||<9tt>||| 2 . 
Obviously, CM is isometric to CM! = L 2 ([0, oo), R d ), so we will in the sequel use the 
notation 

V h ^=D^v= [ J s . t Gdv(s)= [ J s , t Gh(s)ds, if d t v = ft . (4.3) 
Jo Jo 



The representation ( 14.21 ) is still valid for arbitrary stochastic processes h such that h G 
CM' almost surely. 

since G : R d -> U-y + i is a bounded operator whose norm we denote \\G\\, we 
obtain the bound 

< \\G\\ [ \\J Stt \\ \Ks)\ ds < C\\ J.,thHo,t,H)\ihi\ , 
Jo 

valid for every h G CM' . In particular, by Riesz's representation theorem, this shows 
that there exists a (random) element of CM' <8> H such that 

/>oo 

V h ^ t = (D®?, h)'cM = / T>.*?m ds , (4.4) 
Jo 

for every /i G CM' . This abuse of notation is partial justified by the fact that, at least 
formally, T> s Qf — T>h$f with h(r) = 8(s — r). In our particular case, it follows from 
d4.21 i that one has 

V s ^ = J s . t G eR d ®H, t>s, 
and T> s &™ = for s > t. With this notation, the identity ( 14.21 ) can be rewritten as 
T>hU t = LVgUt h(s)ds. It follows from the theory of Malliavin calculus, see for 
example |Mal97 , Nua95 1 that, for every t > (and for any Hilbert space H), the linear 
map $ (— ► V^> described above yields a closable linear operator from L 2 (f2, R) <g) H to 
L 2 d (J7, JF t , CA4') <%) Tt. Here, J-"t is the cr-algebra generated by the increments of W up 
to time t and L 2 d denotes the space of L 2 functions adapted to the filtration \JFt}- 

The operator T> simply acts as the identity on the factor H, so that we really interpret 
it as an operator from L 2 (fi, R) to L 2 (fl, CM'). The operator T> is called the "Malliavin 
derivative." 

We define a family of random linear operators At ■ CM' — ► H (depending also on 
the initial condition u G Ti for d3~TT )) by h h-> (2?$" , ft). It follows from d4~3l ) that 
their adjoints A* t : W — > CA^' are given for ^ G W by 

(^OW=(^ f = G *^ fOTS ^' (4.5) 
' 10 fors > t . 
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Similarly, we define A St t- CM' — > H by ^4 t[S /i = ^t(/il[t, S ]) = (f u t, ^l[t,s]) = 
/ s * J rtt Gh r dr. Observe that .A* t : W -> CM' is given for £ G W by (A* t 0M = 
G*J* t £ = G*K r j£ for r G [s, t] and zero otherwise. 

Recall that the Skorokhod integral h i— > J Q /i(s) • cW(s) = T>*h is defined as the 
adjoint of the Malliavin derivative operator (or rather of the part acting on L 2 (£l, Tt, R) 
and not on H). In other words, one has the following identity between elements of H: 

EV h <f>% = E(V<f>t, h) = e(®£ h(s) ■ dWis)^ , (4.6) 

for every h £ L 2 (il, CM.') belonging to the domain of V* . 

It is well-established [Nua95 Ch. 1.3] that the Skorokhod integral has the following 
two important properties: 

1. Every adapted process h with E|||/i||| 2 < oo belongs to the domain of V* and the 
Skorokhod integral then coincides with the usual Ito integral. 

2. For non-adapted processes h, if h(s) belongs to the domain of T> for almost 
every s and is such that 

pt ft r-t 

E / \V a h(r)\l* ds = E / \\V s hfds < oo , 
Jo Jo Jo 

then one has the following modification of the Ito isometry: 

E^J h(s)-dW(s)y = E J \h(s)\l d ds 

+ E [ [ \xV s h(r)V r h{s)dsdr . (4.7) 
Jo Jo 

Note here that since h(s) g R d , we interpret V r h(s) as a d x d matrix. 
4.2 Malliavin derivative of the Jacobian 

By iterating the implicit functions theorem, we can see that the map that associates a 
given realisation of the Wiener process W to the Jacobian J S) t</? is also Frechet (and 
therefore Malliavin) differentiable. Its Malliavin derivative T>hJ s ,tf in the direction 
h G CM' is given by the unique solution to 

d t V h J Stt (p = -LV h J a>t <p + DN(u t )V h J s . tV + D 2 N{u t ){V h u tl J s>tV ,) , 

endowed with the initial condition T>hJ s>s ip = 0. Just as the Malliavin derivative of the 
solution was related to its derivative with respect to the initial condition, the Malliavin 
derivative of J s t can be related to the second derivative of the flow with respect to the 
initial condition in the following way. Denoting by J~ l(cp, ip) the second derivative of 
Ut with respect to ito m the directions tp and ip, we see that Jy t {tp, ip) is the solution to 

dtfftto VO = -LJffa, 10 + DN(u t )jfl(<p, V) + D 2 N{u t ){ J 3 ^, J S)t¥ >) , 

endowed with the initial condition J^ 2) s (ip, tp) = 0. 

Assuming that h vanishes outside of the interval [s,t] and using the identities 
Jr,tJs.r — Js.t and T>hU t = J* J r jGh(r) dr, we can check by differentiating both 
sides and identifying terms that one has the identity 

V h J s , t ip= f jf}{Gh{r),J s , r ip)dr , (4.8) 
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which we can rewrite as 



V r J s ^ = 4 2) t (G,J s , r <p) (4.9) 



'r,i 

This identity is going to be used in Section[5] 
4.3 Malliavin covariance matrix 

We now define and explore the properties of the Malliavin covariance matrix, whose 
non-degeneracy is central to our constructions. 

Definition 4.4 Assume that the explosion time r = oo for every initial condition in Ti. 
Then, for any t > 0, the Malliavin matrix Ait ■ Ti — > Ti is the linear operator defined 
by 

d ft 

M t (f = ^2 / ( J s.tgk,f)J s .t9kds. (4.10) 
k=i Jo 

Observe that this is equivalent to 

M t = A t A* t = I J s , t GG* J s * t ds = [ J s .tGG*K s . t ds , 



thus motivating the definition Ai s t = A s jA* s t for arbitrary time intervals < s < t. 
From this it is clear that M Si t is a symmetric positive operator with 

d pt d „t 

(Mt<p t <p)=Y, (Js.t9k,^) 2 ds = J2 (9k,K s .t^) 2 d S (4.11) 

k=l " k=l J ° 

for all Lp eH. 

The meaning of the Malliavin covariance matrix defined in ( 14.101 ) is rather intuitive, 
especially for the diagonal elements (Ai t ip,<p). If (A4t>p,ip) > then there exists 
some variation in the Wiener process on the time interval [0, t] which creates a variation 
of u t in the direction (p. 

It is also useful to understand on what spaces the operator norm of Ait is bounded. 
As a simple consequence of Proposition |3.6l we have: 



Proposition 4.5 For every T > and 7 E [0, 1 — a), A4t can be extended to a 
bounded (random) linear operator from Ti^-y to 7i 7 with probability one. In particular, 
Air is almost surely a positive, compact, self-adjoint linear operator on Ti such that 
the bound 

sup (M T <p,i>}<TC sup sup(l + || Ut ||) 2 ' 1 - 2 ||J M ^|| 2 

ip,ip£H-~, 0<s<t<T k 

holds with some deterministic constant C. 
Proof. From ( 14.101 ) we have that 



sup {M t ip,tp)<y2 \\ J s,tgk\\ 2 ds 

!>eW- 7 k=i Jo 



tp,i/> 

I|V||-T = II'/'I|-T = 1 

Since the belong to H by assumption, the required bound now follows from Propo- 
sition [32] □ 
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5 Smoothing in infinite dimensions 

We now turn our study of (13.31 ) to one of the principle goals of this article. As in the 
preceding section, we all solutions are global in time and that standing assumptions 
in Assumption I A. 1 1 continue to hold. The aim of this section is to prove "smoothing" 
estimates for the corresponding Markov semigroup Vt whose action on bounded test 
functions <p : Ti — > R is defined by 

V t tp(v) = E v ip(u t ) ■ 

Here, the subscript in the expectation refers to the initial condition for the solution u t 
to ( 13.31 ). We begin with a brief discussion of the type of estimates we will prove and the 
ideas used in their proof. A long discussion on this can be found in 1HM061 in which 
a number of the tools of this paper were developed or HMatQ81 which has a longer 
motivating discussion. 

Recall also that the Malliavin covariance matrix Ai t ■ Ti — * W for the solution 
to ( 13.31 ) was defined in ( 14.10b as M t = AtA* t and that it is a random, compact self- 
adjoint operator on Ti. Since Ti is assumed to be infinite-dimensional, Ai t will in 
general not be invertible. However as discussed in the introduction we will only need 
it to be "approximately invertible" on some subspace paired with a assumption that the 
dynamics is counteractive off this subspace. The assumption of "approximately invert- 
ible" on some subspace is formulated in below Assumption IB. 1 1 and the contractivity 
assumption is formulated in Assumption lB.4l These are the two fundamental structural 
assumptions needed for this theory. In between the statement of these two assumption 
two other assumptions are given. They are more technical in nature and ensure that we 
can control various quantities. 

Assumption B.l (Malliavin matrix) There exists a function U: Ti — > [1, oo) and an 

orthogonal projection operator II : Ti — > Ti such that, for every a > 0, the bound 

P( inf {ipMw) <e\W\\ 2 ) < C( aiP )U p (u )e p (5.1) 

V ||n^|| >a || if I / 

holds for every e < 1, p > 1 and uq £ Ti.. Furthermore for some q > 2, there exist a 
constant Cjj so that for every initial condition uq G Ti., the bound 

EUHu n ) < CfjUHuo) 

holds uniformly in n > 0. 

We are also going to assume in this section that the solutions to ( 13.3b have the 
following Lyapunov-type structure, which is stronger than Assumption lC. 1 l used in the 
previous section: 

Assumption B.2 (Lyapunov structure) Equation ( li.il ) has global solutions for every 
initial condition. Furthermore, there exists a function V : Ti — > R+ such that there 
exist constants Cl > and r{ € [0, 1) such that 

E exp(V(«i)) < «p(f/V(uo) + C L ) . (5.2) 

Assumption B.3 (Jacobian) The Jacobian J s t and the second derivative \ satisfy 
the bounds 

E||J.,tf < eMPvV(u ) + pCj) , 
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E\\J^f< e x V (pnV(u )+pCf), 

for all < s < t < 1 and for some constants p > 10 and i] > w/f/i pi] < I — if and 
2/q + 10/p < 1, where rj' is the constant from Assumption \B.2\ and q the constant from 
Assumption \B.l\ 

Remark 5.1 When we write ||J®|| we mean the operator norm from Ti <g> Ti — > Ti, 
namely sup^ eH || J^Ml/iMHWl 

We finally assume that the Jacobian of the solution has some "smoothing prop- 
erties" in the sense that if we apply it to a function that belongs to the image of the 
orthogonal complement = 1 — II of the projection operator II then, at least for 
short times, its norm will on average be reduced: 

Assumption B.4 (Smoothing) One has the bound 

E||Jo 1 in x f < exp(p v V(u ) - pC u ) , 

for some constant Cn such that Cn — Cj > 2kCl where n = 77/(1 — 77')- The constants 
rj and p appearing in this bound are the same as the ones appearing in Assumption \B.3\ 
the constant rj' is the same as the one appearing in Assumption \B.2\ and the projection 
II is the same as the one appearing in Assumption ^. l\ 

Remark 5.2 The condition Cn — Cj > 2kCl may seem particularly unmotivated. In 
the next section, we try to give some insight into its meaning. 

Remark 5.3 Notice that if Range(IT) C spanjgi, . . . ,gd}, then in light of the last 
representation in (14. lit it is reasonable to expect ( 15.1b to hold as long as one has some 
control over moments the modulus of continuity of s 1— ► K Syt - (This is made more 
precise in Lemma |6.18l ) We refer to such an assumption on the range as the "essentially 
elliptic" setting since all of the directions whose (pathwise) dynamics are not controlled 
by Assumption lB.4l are directly forced. 

Under these assumptions we have the following result which is the fundamental 
"smoothing" estimate of this paper. It is the linchpin on which all of the ergodic results 
rest. 

Theorem 5.4 Let Assumptions IA.il and \B.ltB~4\ hold. Then for any ( G [0, (Cn — 

Cj) /2—kCl) there a exist positive constants C such that for all n € N and (p : TL — > R 

||D(P 2 »¥>)(u)|| < e 4Ky(Mo) (c^ 2 (W(7W)(^) + l 2n ^ {V 2 n\\V<p\\ 2 )(u)) (5.3) 
where 7 = exp(— Q. 

Remark 5.5 By CP t ||D<^|| 2 )(u), we simply mean E„(sup|| f |i =1 \(D(p)(u t )£,\ 2 ). 

Remark 5.6 If Halloo or 1 1 ^D^l 1 00 are bounded by one then the corresponding terms 
under the square root are bounded by one. Furthermore, in light of Assumption lB.21 if 

<p(u) 2 < exp(V(u)), then 

yJV 2n <P 2 {u) < |M|ocVEexp(F(u 2n )) < ||^|| oo exp(77'^(7/. )/2 + C L /(2- 277')) . 

Of course, the same bound for holds for y/ eP2n||EMj 2 )( M )' provided that one has an 
estimate of the type ||Z?<p|| 2 (?j) < exp(V(u)). 
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5.1 Motivating discussion 

We now discuss in what sense (15.3b implies smoothing. When the term "smoothing" is 
used in the mathematics literature to describe a linear operator T, it usually means that 
Tip belongs to a smoother function space than ip. This usually means that Tip is "more 
differentiable" then ip, A convenient way to express this fact analytically would be an 
estimate of the form 

||DCZV)(u)|| < C(u)|M|oo • (5.4) 

(Of course the "smoothing" property may improve the smoothness by less than a whole 
derivative, or one may consider functions ip that are not bounded, but let us consider 
(15.41 i just for the sake of the argument.) This shows in a quantitative way that Tp is 
differentiable while ip need not be. In light of Remark 15761 this is in line with the first 
term on the right hand side of d5.31 l. 

The second term on the right hand side of ( 15.3b embodies smoothing of a different 
type. Suppose that T satisfies the estimate 

llDMloo <C|M|oc +7lMoo (5.5) 

for some positive C and some 7 £ (0, 1). (Note that this is a variation of what is 
usually referred to as the Lasota-Yorke inequality IILY73lEv03l or the Ionescu-Tulcea- 
Marinescu inequality |ITM50|.) Though ( 15.5b does not imply that Tip belongs to a 
smoother function space then p, it does imply that the gradients of Tip are smaller 
then those of ip, at least as long as the gradients of ip are sufficiently steep. This is in 
line with a more colloquial idea of smoothing, though not in line with the traditional 
mathematical definition used. 

5.1.1 Strongly dissipative setting 

Where does the assumption Cn > Cj + 2kCl come from? This is easy to understand 
if we consider the "trivial" case IT = 0. In this case, Assumption lB.ll is empty and the 
projection IT- 1 is the identity. Therefore, the left hand sides from Assumptions lB.3l and 
IB.4l coincide. so that one has Cj = — Cn and our restriction becomes Cj + kCl < 0. 

This turns out to be precisely the right condition to impose if one wishes to show 
that E|| Jo,n|| — > at an exponential rate: 

Proposition 5.7 Let Assumptions \B.2\ and \B.3\ hold. Then, for any p G [0,p], one has 
the bound 

E|| J ,„|| p < exp( P KV(u a ) + P C T n) , 
with k = 7]/(l — 77') and Ct = Cj + kCl,. 

Proof. Using the fact that ||Jo,n|| < || J,i-i, n || || Jo,n-i||, it is easy to show that the 
following recursion relation holds: 

E(exp(KV(u„))|| Jo,„|| p ) < e CT E(exp(pKV(u n ^))\\ J ,„-i|| p ) 

It now suffices to apply this n times and to use the fact that || Jo, oil = 1- □ 

We now use this estimate to prove a version of Theorem 15 .41 when the system is 
strongly dissipative: 
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Proposition 5.8 Let Assumptions \B.2\ and \B.3\ hold and set Ct = Cj + kCl with 
K = 77/(1 — rf) as before. Then, for any ip : 7i — ► R and n G N one has 

\\T>(V n <p)(u)\\ < re^WPJ^WHu) . 

with 7 = eF T . In particular, the semigroup Vt has the asymptotic strong Feller prop- 
erty whenever Ct < 0. 

Proof. Fixing any £ 6 H with ||£|| = 1, observe that 

DCP t v>)(«)£ = Ev,(Dtp)(!H)Jo,tt < ^EllJo.tliVEllD^H 2 ^) . 

Applying Proposition l5.7l completes the proof. □ 

Comparing this result to the bound (15.31 ) stated in Theorem 15.41 shows that, the 
combination of the smoothing Assumption lB.4l with Assumption IB. 1 I on the Malliavin 
matrix allows us to consider the system as if its Jacobian was contracting at an average 
rate (Cn — Cj)/2 instead of expanding at a rate Cj. This is precisely the rate that 
one would obtain by projecting the Jacobian with H 1 ^ at every second step. The addi- 
tional term containing T > 2n l P 2 appearing in the right hand side of ( |5.31 l should then be 
interpreted as the probabilistic "cost" of performing that projection. Since this "pro- 
jection" will be performed by using an approximate inverse to the Malliavin matrix, 
it makes sense that the larger the lower bound on M. t is, the lower the corresponding 
probabilistic cost. 

Remark 5.9 It is worth mentioning, that nothing in this section required that the num- 
ber of Wiener process be finite. Hence one is free to take d = 00, as long as all of the 
solutions and linearization are well defined (which places conditions on the gt). 

5.2 Transfer of variation 

Having analyzed the strongly dissipative setting, we now turn to the general setting. 
We would like to mimic the calculation used in Proposition 15.81 but we do not want 
to require the system to be "contractive" in the sense of being strongly dissipative. 
However, in settings where one can prove (I5.41 i there is usually no requirement of strong 
dissipativity but rather an assumption of hypoellipticity. This is because the variation 
in the initial condition is transferred to a variation in the Wiener space. Mirroring the 
discussion in |Mat08 HM06| (where more details can be found), we begin sketching 
a proof of ( 15 ,4b and then show how to modify it to obtain (15.5b . The central idea is 
to compensate as much as possible the effect of an infinitesimal perturbation in the 
initial condition to an infinitesimal variation in the driving Wiener process. In short, to 
transfer one type of variation to another. 

Denoting by S = {£ € H : ||£|| = 1} the set of possible directions in H, let 
there be given a map from S x C([0, 00), R d ) -> CM' denoted by (£, W) i-> h^(W), 
mapping variations in in the initial condition u to variations in the Wiener path W. We 
will worry about constructing a suitable map in the next sections; for the moment we 
just explore which properties of h£ might be useful. Fixing t, let us begin by assuming 
that the following identity holds: 



D^(W) = (D$£(W),£) = (V<f>?(W),h 6 (W)} = V hi ^(W) . (5.6) 
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(The first and last equalities are just changes in notation.) In words, the middle equality 
states that the variation in Ut(W) caused by an infinitesimal shift in the initial condi- 
tion in the direction £ is equal to the variation in u t caused by an infinitesimal shift 
of the Wiener process W in the direction h£(W). This is the basic reasoning behind 
smoothness estimates proved by Malliavin calculus. We begin as in the proof of Propo- 
sition [578] For any £ G S, one has that 

D c = (D^($«)D e ^ - (Dp)(*?)Z> W fc« = V hi (<p(<f>?)) . (5.7) 

Taking expectations and using the Malliavin integration by parts formula ( 14.6b to obtain 
the last equality yields 

D s VMu) = ED c M$ t ")) = EZ> fc< ft**?)) = E u <^) / fcf ■ dW(«) ■ 

Jo 

Applying the Cauchy-Schwartz inequality to the last term produces a term of the form 
of the first term on the right-hand side of ( 15.3b provided E| J * hf ■ dW(s)\ 2 < oo. 
Taken alone, provided one can find a mapping (£, W) > /iHTV) satisfying ( 15.61 ) with 
E| Jo ' < °°> we have proven an inequality of the form ( 15.4b . 

In the infinite-dimensional SPDE setting of this paper, finding a map (£, TV) i— > 
ht(W) satisfying ( 15.6b seems hopeless, unless the noise is infinite-dimensional itself 
and acts in a very non-degenerate way on the equation, see ||Ma s89 l DPE Z951IEH01I 
or the monograph 1DPZ96II for some results in this direction. Instead, we only "ap- 
proximately compensate" for the variation due to differentiating in the initial direction 
£ with a shift in the Wiener process. As such, given an mapping (£, W) >— > h$(W), we 
replace the requirement in ( 15.6b with the definition 

p t (W) = B c ^(W)-V h ^(W) (5.8) 

and hope that we can choose h£ in such a way that p t — > as t — > oo. As before, we 
postpone choosing a mapping (£, TV) i— > ft>(TV) until the next section. For the moment 
we are content to explore the implications of finding such a mapping with desirable 
properties. 

Returning to ( 15.71 ) but using ( 15.81 ), we now have 

D f = (D^)($?)D C $? = (D^)^? + (Dp)($> t 

= © M (vK*t)) + (Dv)(** , )ft ■ 

Taking expectations of both sides and applying the Malliavin integration by parts the 
first term on the right-hand side produces 

D s VMu) = ED S {^)) =Eip(^) f hi-dW(8) + -E(P<pX*t)Pt 

Jo 

which in turn, after application of the Cauchy-Schwartz inequality twice, yields 

\\DTMu)\\ < C(t)^(T tV > 2 )(u) + r(t)y/(P t \\D<p\\ 2 )(u) (5.10) 

with C(t) = y / E|jf/4 -dW(s)\ 2 and T(t) = ^E|p t | 2 . Observe that provided that 

lim sup C(n) < oo and lim supr(n)7~" < oo (5.11) 

nGN n£N 

for some 7 G (0, 1) we will have proved Theorem l5.4l Choosing a mapping (£, W) 1— > 
/i*(W) so that these two conditions hold is the topic of the next four sections. 
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5.3 Choosing a variation /if 

As discussed in [HM06| and at length in [Mat08], if one looks for the variation h$ such 
that ( 15.6b holds and J Q |/if \ 2 ds is minimized, then the answer is /if = (A% Ai^ 1 JtO( s ) 
which by the observation in ( 14.51 is simply /if = G*K S ^M.^ Jtt,- While this is not 
quite the correct optimisation problem to solve since its solution h£ is not adapted to 
W and hence E| f* /if ■ dW(s)\ 2 ^ J Q E|u s \ 2 ds, it is in general a good enough choice. 

A bigger problem is that the space on which Ait can be inverted is far from evi- 
dent. If the range of G was dense in H (which requires infinitely many driving Wiener 
processes), then there is some chance that Range( J t ) C Range(A^t) and the above 
formula for h t could be used. This is in fact the case where the Bismut-Elworthy-Li 
formula is often used and which might be refereed to as "truly elliptic." It this case the 
system is in fact strong Feller. We are precisely interested in the case when only a finite 
number of directions are forced (or there variance decays so fast that it is effectively 
true). One of the fundamental ideas used in this article is that we need only effect they 
system of a finite dimensional subspace since the dynamic pathwise control embodied 
in Assumption lB.4l can control the remaining degrees of freedom. 

While Theorem 16. 7 1 of the next section gives conditions that ensure that Ait is 
almost surely non-degenerate, it does not give much insight into the structure of the 
range since it only deals with finite dimensional projections. However, Assumption lB.il 
ensures that it is unlikely the eigenvectors with sizable projection in HH have small 
eigenvalues. As long as this is true, the "regularised inverse" {Ait + /3) _1 , which 
always exists since Ait is positive definite, will be a "good inverse" for Ait, at least 
on IYH. This suggests that we make the choice /if = G* K S: t{Ait + P) l JtS, for some 
very small j3 > 0. Observe that 

D iUt - v ht u t = j t £ - Mt(Mt + pr 1 .^ = p(M t + pr 1 .^ , (5.12) 

which will be expected to be small as long as J t £ has small projection (relative to the 
size of (3) in II 7i. But in any case, the norm of the right hand side in ( 15.121 ) will never 
exceed the norm of Jt£, so that for small values of j3, ||D^u t — T> h zUt\\ is expected to 
behave like ||IT L J t £||. 

Assumption lB.4l preciselv states that if one projects the Jacobian onto H^H, then 
the system behaves as if it was "strongly dissipative" as in Section [5. 1.1 1 All together, 
this motivates alternating between choosing = A* l n+1 (Ai n ,n+i + n )~ 1 Jn,n+iPn 
for even n and h£ = on [n, n + 1] for odd n. 

Since we will split time into intervals of length one, we introduce the following 
notations: 

Jfl Jn.n+l i An An,n + 1 ? Ai n Ain^n-\-\ • 

We then define the map (£, W) i— > h^(W) recursively by 

h C = fULC&n + M 2n r 1 JtP2n)(s) for s G [2ti, 2n + 1) and n e N , 
8 [0 for s G [2n — 1 , 2n) and n e N . 

Here, as before, po = £, pt = Jo,t£ — -4o,t^f = D^u* — T> h zu t , and j3 n is a sequence 
of positive random numbers measurable with respect to T n which will be chosen later. 

Observe that these definitions are not circular since the construction of h\ for s G 
[n, n + 1) only requires the knowledge of p n , which in turn depends only on /if for 
s G [0, n). The remainder of this section is devoted to showing that this particular 
choice of is "good" in the sense that it allows to satisfy ( 15.1 11 1. We are going to 
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assume throughout this section that Assumptions I A. 1 1 and IB . 1 llB~4l hold. so that we are 
in the setting of Theorem l5.41 and that h£ is defined as in ( 15.13b . 

5.4 Preliminary bounds and definitions 

We start by a stating a few straightforward consequences of Assumption lB.2l 
Proposition 5.10 For any a < 1, one has the bound 

E exp(aT/(ui)) < exp(ar/ 'V (u ) + aC L ) . 
Furthermore, for rj > and p > such that rjp < 1, one has 

Eexp(rjpV(u n )) < exp(p?7(?/)™T/(uo) + pnC L ) . 
Finally, setting k = r]/(l — T]') as before, one has the bound 

n 

E exp (rip ^2 V( u k)j < exp(pnV(u ) + pnC L n) , 

fc=0 

provided that up < 1. 

Proof. The first bound follows immediately from Jensen's inequality. The second and 
third inequalities are shown by rewriting the estimate from Assumption lB.2l as 

E(exp(?7py(u„))|J : "„_i) < exp(r]pr]'V(u n ^ 1 ) + i]pC L ) , 

and iterating it. □ 

Similarly, we obtain a bound on the Jacobian and on the Malliavin derivative A n 
of the solution flow between times n and n+1: 

Proposition 5.11 For any p £ [0,p], one has 

sup E||J M f < exp(p(r,'rvV(u )+ P Cj+pKC L ) (5.14) 

n<s<t<n+l 

E||A|| p < ||Gfexp( w (7 7 'ry( Uo )+^C L + P Cj) . (5.15) 

Furthermore, ( 15. 741 ) also holds for with Cj replaced by Cj . 

Proof. We only need to show the bound for p = p, since lower values follow again from 
Jensen's inequality. The bound ( 15.14b is an immediate consequence of Assumption lB.2l 
and Proposition l5.10l The second bound follows by writing 



n+1 



\A n h\\ 



<\\G\\ P ( / ||Jr,n + l|| P dr)|||/C, 
J n 

and then applying the first bound. □ 



J r , n +iGh r dr 

/■n+1 £ rn+1 

< \\G\\ P { I ||J r ,„+i|| 2 rfr) 2 ( / \h r \'dr 

n 

r*n+l 
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In addition to these first Malliavin derivatives, we will need the control of the deriva- 
tive of various objects involving the Malliavin derivative. The following lemma gives 
control over two objects related to the second Malliavin derivative: 

Lemma 5.12 For all p £ [0 7 p/2], one has the bounds 

sup E||D* J r , n+1 f < exp(2pr 1 (T 1 'rV(u ) + 2 P kC l + pCj + pCf) , 

s,r£[n,n-\-l] 

sup EH^Ajr < \\G\\ P exp(2 PV ( v TV(u ) + 2 P kC l + pCj + pCf) . 

Proof. For this, we note that by ( 14.9b one has the identities 

jfl+i(JrA'9i) for r<s, 



Jl 2 l+i( J s,rgz,0 for s<r, 



rn+1 

V\A n v= \ V\J„ l+1 Gv r dr . 

J n 

Hence if p € [0,p/2] (which by the way also ensures that 2pn < 1) it follows from 
Proposition 15.1 II that 

E||0:J r , n+1 f < (E||J^ +1 || 2 f E||J r . s || 2 P)* <Eexp(2p V V(u n )+pC J +pCf ) ) 
< exp(2 P T 1 (T 1 TV(u ) + 2 P kC l + pCj + pCf) 

for r < s and similarly for s < r. Since, forp > 1, we can write 

i-n+l 

E\\VlA n \\ p < \\G\\ P / E\\VlJ rtn+1 fdr , 

J n 

the second estimate then follows from the first one. □ 
5.5 Controlling the error term p t 

The purpose of this section is to show that the "error term" p t = D^ut — T> h zu t goes 
the zero as t — * oo, provided that the "control" hfi is chosen as explained in Section [531 
We begin by observing that for even integer times, p n is given recursively by 

P2n+2 = J2n+lP2n+l — ^2>i+ {Rt£? J 2n p 2n , (5.16) 
where lZ k is the operator 

nl = 1 - M k (f3 + MuT 1 = /?(/? + Mk)- 1 ■ 

Observe that 7Z k measures the error between AikiP + Mk) 1 an d the identity, which 
we will see is small for (3 very small. This recursion is of the form p 2n + 2 = ^ 2 n+2P2m 
with the (random) operator E 2n+2 ■ T~L — > H defined by E 2n +2 = hn+iR-^n" hn- 
Notice that E 2n is ^-2n-measurable and that Hfc is defined only for even integers k. 
Define the n-fold product of the E 2 k by 

n 

E^ = l[E 2k , 

k=\ 
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so that p 2n = S< 2 "'e 

It is our aim to show that it is possible under the assumptions of Section[5]to choose 
the sequence f3 n in an adapted way such that for a sufficiently small constant fj and 
p G [Q,p/2] one has 

E||p 2n f <E(||S (2 ' l) r)||p r <exp(pf}V(uo)-pnk)\\p r . (5.17) 

for some k > 0. This will give the needed control over the last term in ( 15. 101 ). 

By Assumption IB. II we have a bound on the Malliavin covariance matrix of the 
form 



Tk ) <C(a,p)U p (u k )e p . (5.18) 



P inf <<p,M k <p) <e|M| 2 

\\\u^\\> a \\ v \\ 

Here, by the Markov property, the quantities e and a do not necessarily need to be 
constant, but are allowed to be T k -measurable random variables. 
In order to obtain ( 15.17b . the idea is to decompose H2n+2 as 

22n+2 = •hn+l'R-2 2 n J2n = (Jjin+l ^ )"R-2n" ^2n + Jin+l (IIT^fn") ^2n . 
— -f2n+2 : l + J2n+2,2 • 

The crux of the mater is controlling the term IVR,^™ since J2n+iII- L is controlled by 
Assumption lB.4l and we know that WH^n II < 1- To understand and control the i2n+2,2 
term, we explore the properties of a general operator of the form of Tl^ n ■ 

Lemma 5.13 Let II be an orthogonal projection on Ti and M be a self-adjoint, posi- 
tive linear operator on TL satisfying for some 7 > and S G (0, 1] 



(MU) 

^ 11 ei 



inf n ^; 2 s/ > 7 , (5.20) 



where A s = : ||II£|| > S\\Z\\}. Then, defining R = l-M^+M)" 1 = fiifi+My 1 
for some /3 > 0, one has ||IIi?|j < S V \/ pTpy. 

Proof Since ||-R|| < 1, for R£ G one has 

||nj^|| < ||n^|| < 5 _ 



II5II " ll^l 

Now for i?£ G As, we have by assumption ( 15.201 ) 

llni^ll 2 pC|| 2 (MRt,B$ ((M+jJmRQ _ foig) . 

7 lien 2 - 7 licil 2 - lien 2 - ll^ll 2 ' lien 2 

Combining both estimates gives the required bound. □ 
This result can be applied almost directly to our setting in the following way: 

Corollary 5.14 Let M(u>) be a random operator satisfying the conditions of Lemma 
\5.13\ almost surely for some random variable 7. If we choose (3 such that, for some 
(deterministic) 8 G (0, 1) , p > 1 and C > 0, one has the bound P(/9 > <5 2 7) < CS P , 
thenW[\UR\\ p <(1 + C)6 P . 
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In particular, for any 5 £ (0, 1), setting 

S 3 

0k = r - (5-21) 

U(u k )C{5,p)p 

where C is the constant from A5.18i . produces the bound EQjlTT?.^, 2 ™ || p |-7"2n) < 2S P , 
valid for every p < p. 

Proof. To see the first part define ilo = {o-> ■ (3(ui) < S 2 j{uj)}. It the follows from 
Lemma lBTlJl the fact that ||ILR| < 1 and the assumption P(f2g) < C8 P , that 

E||ILR||p <E^5V^/|) P l f2o +l n ^ < S p + P(fig) < (1 + C)8 P , (5.22) 
as required. 

To obtain the second statement, it is sufficient to consider ( 15.181 ) with e = (3 2 n/8 2 , 
so that one can take for 7 the random variable equal to e on the set for which the bound 
<I5 . 1 8b holds and on its complement. It then follows from the choice (15. 2U for j3 2n 
that the assumption for the first part are satisfied with C = 1 and p = p, so that the 
statement follows. □ 

We now introduce a "compensator" 

X2n+2 = exp (T]V(u 2n +l) + r]V( u 2n)) , 

and, in analogy to before, we set ^ (2n) = J|^ =1 \2k- Proposition 15.101 implies that for 
any p G [0,p] 

E(x (2n) )P < exp ( pKV ( UQ ) + pK C L 2n) , (5.23) 

where k = 77/(1 — 77')- Note that Assumption lB.3l made sure that 77 is sufficiently small 
so that Kp < 1. With these preliminaries complete, we now return to the analysis of 

dug. 

Lemma 5.15 For any e > and p S [0, p/2], there exists a 6 > sufficiently small so 
that if one chooses (3 n as in Corollarv \5.14\ and r\ such that up < 1, one has 

E(||S 2 „ + 2|| P X2,f+2l- ?r 2r l ) < SXp(pCj - pC n + £p) ■ 

Proof. Since for every e > there exists a constant C e such that |x+y| p < e pe / 2 |x| p + 
C P \y\ p , recalling the definition of l2n+2.i and from ( 15.191 we have that 

E(||H 2n+2 r X 2„ P +2 |^2n) < e sp / 2 E(\\I 2n+2A \\ p X - p +2 \^ 2n ) 

+ C P E(\\l2n + 2,2\\ P X2n + 2\^2n)- 

We begin with the first term since it is the most straightforward one. Using the fact 
that 1 1 7^2^™ || < 1 and that prj < 1 — if by the assumption on 77, we obtain from 
Assumptions IB. 21 and lB.3l that 

E(||/ 2 „ +2)1 f X2 „ p +2 |^ 2 J<exp(-p^ 

X eXp(-7J77U(u 2 n+l))||^2«|| P F 2n 

< exp(pCj - pC n ) ■ 
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Turning to the second term, we obtain for any S £ (0, 1) the bound 



E(!|W 2:2 || P X2"„ P +2 I^2„) < exp(-p^( U2 „))VE(||n^"|| 2 f|^ 2n ) 



' E (E( || J 2n+1 1 1 2 p \T 2n+ i) exp(-2pr? V(u 2n +i )) 1 1 J 2 „ 1 1 2p 
< exp( P 2Cj)5 p V2 , 



•7"2n 



provided that we choose (3 n as in Corollary 15. 141 Choosing now S sufficiently small (it 
suffices to choose it such that S p < ^£=C~ p e~ pC '~ pCn for every p < p/2) we obtain 
the desired bound. □ 

Combining Lemma 15.151 with ( 15.23b . we obtain the needed result which ensures 
that the "error term" p t from ( I5.10l l goes to zero. 

Lemma 5.16 For any p £ [0, pj 4] and k £ [0, Cn — Cj — 2kCl) there exists a choice 
of the (3 n of the form ( 15.271 ) so that 

E || 3 (2n+2)|| P < exp (p K V(u )-piin) , 

for all uq £ 7~l. 
Proof. Since 

i i 

E || S (2n+ 2 )|jp < ^ E || S (2n+2)||2p^(2n+2)-j-2j,j 2 ^ E( -^(2n+2)- ) 2p^ 2 ^ 

the result follows by combining Lemma |5.15| with d5.23t . □ 

5.6 Controlling the size of the variation hf 

We now turn to controlling the size of 



E 



(5.24) 



uniformly as n — > oo. We assume throughout this section that hf was constructed as 
in Section l5~3l with j3 n as in ( 15. 2U . 

Since our choice of h\ is not adapted to the W s , this does not follow from a simple 
application of Ito's isometry. However, the situation is not as bad as it could be, since 
the control is "block adapted." By this we mean that h n is adapted to T n for every 
integer value of n. For non-integer values t £ (n, n + 1], h t has no reason to be JF t - 
measurable in general, but it is nevertheless JF„ +1 -measurable. The stochastic integral 
in ( 15.241 ) is accordingly not an Ito integral, but a Skorokhod integral. Hence to estimate 
( 15.24b we must use its generalization given in ( 14.71 ) which produces 



E 



/ (hidW s ) <n\h%, 2n] + Y, / E\\V s h t \\mdsdt (5.25) 

Jo k=a 2k 2k 



where |||/|||j = Jj |/(s)| 2 ds and ||M||hs denotes the Hilbert-Schmidt norm on linear 
operators from R d to R d . We see the importance of the "block adapted" structure of 
h s . If not for this structure, the integrand appearing in the second term above would 
need to decay both in s and t to be finite. 
The main result of this section is 
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Proposition 5.17 Let Assumptions \B. lWB.4\ hold. Then, if one chooses 8„ as in 115.2 it , 

there exists a constant C > such that 



lim E 



(hidW s ) 2 < Cexp((8 V + 2K)V(u Q ))U 2 (u )M\\ 2 ■ 



Proof of Proposition ^. 17\ In the interest of brevity we will set M. n = M. n + f3 n and 
/„ = [n, n + 1]. We will also write for the norm on L 2 (I,R d ) viewed as a subset 
of CM' and we will use || ■ || and ||| ■ |||/ to denote respectively the induced operator norm 
on linear maps from riXari and CM' to ri. Hopefully without too much confusion, 
we will also use ||| • |||/ to denote the induced operator norm on linear maps from Ti to 
CM'. In all cases, we will further abbreviate \h\i n to |||ft.|||n- 

Observe now that the definitions of M. n and A n imply the following almost sure 
bounds: 

i\M- 1/2 A n \\ n < 1 , < 1 , \\M- 1/2 \\ < /3,7 1/2 • (5.26) 

We start by bounding the first term on the right hand-side of < !5.25b . Observe that 

n 

lll^lll[0,2 Il] =El /l l2 fc - ( 5 - 27 ) 

fe=0 

Using the bound on A^M^ 1 from ( 15.26b . we obtain 

\\hhk - U* 2k M- k 1 J2kP2khk < (3~ h 1/2 \\ J2k\\\\ P 2k\\ ■ 

By our assumption that 10/p + 2/q < 1 we can find l/q+ 1/r + 1/p = 1 with q < q, 
2r < p and 2p < p. By the Holder inequality we thus have 

EiNiL < (e/3 2 7) i/<z (eii j 2k \n 1/r m\P2 k \\ 2p ) 1/p ■ 

From Proposition l5.11l Assumption lB. 1 l and Lemma l5.16l we obtain the existence of a 
positive constant C (depending only on the choice made for k and on the bounds given 
by our standing assumptions) such that one has the bounds 

(Ej| J 2k \\ 2r ) 1/r < exp(27 ? (7 ? ') 2fe F( Wo ) + 2Cj + 2kC l ) , 

(E/3 2 7) 1/9 <CU(u ), (5.28) 
(E|| (02fc || 2p ) 1/P < cx P (2kV(u ) - 2£fc)||£|| 2 . 

combining these bounds and summing over k yields 

E||NII[o,2„] < CU(u )exp(2( V + K)V(u ))U\\ 2 , (5.29) 

uniformly in n > 0. 

We now turn to bound the second term on the right hand side of ( 15.251 . Since the 
columns of the matrix representation of the integrand are just T>\, the ith component of 
the Malliavin derivative, we have 



j-2k+l p2k+l rn „2k+l 

/ / \\V s h t \\n S dsdt = J2 / \IKHlkds. (5.30) 

'2fc J2k ■ i J2k 
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From the definition of h t , Lemma [5 .121 the relation M 2k = -^2k-^ 2k + /^2k> ar, d the 
fact that both p 2k and /3 2 fc are ^fc-measurable, we have that for fixed s 6 I^k, T^\h is 
an element of L 2 (I 2k , R) C CM' with: 

Xty = (V l s A* 2k )M^J 2 kP2k + A* 2k M^(VlJ 2k ) P2k (5.31) 

- A* 2k M 2k \{V\A 2k )A* 2k + A 2k {VlA* 2k ))M 2k }j 2kP2k . 

For brevity we suppress the subscripts k on the operators and norms for a moment. It 
then follows from ( I5.26l l that one has the almost sure bounds 

\\M- l A\ < \\M- 1/2 \\\\M- 1/2 A\l < /r 1 / 2 , 

IIK^ril < \\a*m- 1/2 \\\\m- 1,2 \\ < /r 1 / 2 , 

\\{V\A*)M- l J\\ < Ip^IIIM-^lllJll <ri^I||J|| , 

\\a*m'\v\j)\\ < w'm-^wvuw < r 1/2 \\vij\\ . 

In particular, this yields the bounds 

lA'M-HviAWM-iJl < WM-'fUViAjpW ^-'II^MIIIII J\\ 
\lA*M- 1 A(ViA*)M- 1 J\\ < \lA*M- 1/ Y\lVi-A*\\\\M- 1 \\\\J\\ 

<ri^nj||. 

Applying all of these estimates to ( 15.311 ) we obtain the bound 

l\Vlh\\ 2k < 3/3 2 - fc 1 |||^ 2fe ||| 2fc ||J 2fc ||||p 2fe || + P 2k 1/2 \\VlJ 2k \\\\ P2k \\ . 

The assumption that 10/p + 2/q < 1 ensures that we can find q < q/2, r < p/2 and 
P < p/4 with 1/r + 2/p + 1/q = 1. Applying Holder's inequality to the preceding 
products yields: 

+ 2(E/3 2 7) 1/9 (E||I?: J 2k \\ 2p E\\ P2k \\ 2 P ) 1/p . 

We now use previous estimates to control each term. From Lemma l5.12l and Proposi- 
tion [5TTT] we have the bounds 

(E\\Vl J 2k \\ 2p ) 1/P < exp(47 7 (7 7 ') 2 ' £ y( U o) + 4kC l + 2Cj + 2Cf) , 
(E|||I?^ 2fe ||| 2 ^) 1/p < ||G|| 2 exp(4 ? K ? /) 2fc ^( Uo ) + 4 K C L + 2Cj + 2Cf) . 

Recall furthermore the bounds on p 2k and J 2k already mentioned in ( 15.281 ). Lastly, 
from Assumption lB. 1 1 we have that, similarly as before, there exists a positive constant 
C such that 

(E/3 2 7) 1/9 < (E/3 2 - 2 *) 1/9 < CU 2 (u ) . 
Combining all of these estimates produces 

™ r2k + l 

Y, / WMlk ds < Cexp((8v + 2k)V(u ))U 2 (uo) , 
i=i j2k 

for some different constant C depending only on Cj, C (2) , C'l, f], K, k and the choice 
of S in ( 15. 2\\ . Combining this estimate with ( 15.291 and ( 15.251 ) concludes the proof. 

□ 
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6 Spectral properties of the Malliavin matrix 

The results in this section build on the ideas and techniques from [MP06] and MBM07II . 
In the first, the specific case of the 2D-Navier Stokes equation was studied using sim- 
ilar ideas. The time reversed representation of the Malliavin matrix used there is also 
the basis of our analysis here (see also 0Oco88l ). In the context for the 2D-Navier 
Stokes equations, a result analogous to Theorem l6.7l was proven. As here, one of the 
key results needed is a connection between the typical size of a non-adapted Wiener 
polynomial and the typical size of its coefficients. In I1MP06L since the non-linearity 
was quadratic, only Wiener polynomials of degree one were considered and the cal- 
culations and formulation were made a coordinate dependent fashion. In IB M07I . the 
calculations were reformulated in a basis free fashion which both made possible the 
extension to more complicated non-linearities and the inclusion of forcing which was 
not diagonal in the chosen basis. Furthermore in (BM07}, a result close to Theorem |6.7| 
was claimed. Unfortunately, the auxiliary Lemma 9.12 in that article contains a mis- 
take, which left the proof of this result incomplete. 

That being said, the techniques a presentation used in this and the next section build 
on and refine those from [BM07]. One technical, but important, distinction between 
Theorem 16.71 and the preceding versions is that Theorem 16.71 allows for rougher test 
functions. This is accomplished by allowing Kt.T to have a singularity in a certain 
interpolation norm ast^T. See equation d6.3M for the precise from. This extension 
is important in correcting an error in |HM06| which requires control of the Malliavin 
matrix of a type given by Theorem l6.7l that is with test functions rougher than those 
allowed in [MP06|. Indeed, the second inequality in equation (4.25) of |HM06| is not 
justified, since the operator Mo is only selfadjoint in L 2 and not in H 1 , Theorem 16. 7 1 
rectifies the situation by dropping the requirement to work with H 1 completely. 

6.1 Bounds on the dynamic 

As the previous sections have shown, it is sufficient to have control on the moments 
of u and J in Tt to control their moments in many stronger norms. This motivates the 
next assumption. For this entirety to this section we fix a To > 0. 

Assumption C.l There exists a continuous function ^q: Ti. —> [l,oo) such that, for 
every T G (0, To] and every p > 1 there exists a constant C such that 

E sup lltttf < <7ttg(uo) , 

T<t<2T 

E sup \\J s , t \\ p < CV p (uo) , 

T<s<t<2T 

for every uq G TL. Here, \\J\\ denotes the operator norm of J from Ti to Ti. 
Under this assumption, we immediately obtain control over the adjoint K st . 

Proposition 6.1 Under Assumption \C.l\f or every T > and every p > 1 there exists 
a constant C such that 

E sup \\K a ,tV <C*g(u ), 

T<s<t<2T 

for every uq G Ti. 
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Proof. By Proposition 13.91 we know that K S:t is the adjoint of J s t in Ti. Combined 
with Assumption IC.ll this implies the result. □ 

In the remainder of this section, we will study the solution to (13. Il l away from t = and 
up to some terminal time T which we fix from now on. We also introduce the interval 
Is = ["?! T — S] for some S € (0, -j] to be determined later. Given u t a solution to 
(13 .It . we also define a process vt by v t = Ut~ GW(t), which is more regular in time. 
Using Assumption lC. 1 l and the a priori estimates from the previous sections, we obtain: 



Proposition 6.2 Let Assumption \C.1\ hold and 'J'o be the function introduced there. 
For any fixed 7 < 7* and /?</?* there exists a positive q so that if^ = ^Q then the 
solutions to ( li.il ) satisfy the following bounds for every initial condition uq G 7i: 

Esup \\u t \\ p 1+1 < C p ^ p (u ), iUi) 
tei s 

Esup \\d t v t || J" < C p V p (uo) ■ dOp) 
teh 

Furthermore, its linearization J s t is bounded by 

Esup sup \\J s ,M\j+i <C p V p (u ), dE2h) 

teh IMI<i 

E sup sup \\dtJ., t <p\\* ^ C p mu ) ■ dElh) 

teis \\ip\\<i 

Finally, the adjoint K s j to the linearization satisfies the bounds 

E sup sup \\K tt T<p\\a +1 < — j— — , d63ii) 

teis \\v>\\<i 

Esup sup \\d t K t ^\\ p < ^J ( " o) , O) 
tei s \\ip\\<i 



where pp is as in Proposition ^ .8\ In all these bounds, C p is a constant depending only 
on p and on the details of the equation ( li.il ). 



Remark 6.3 One can assume without loss of generality, and we will do so from now 
on, that the exponent q defining ^ is greater or equal to n, the degree of the nonlinearity. 
This will be useful in the proof of Lemma l6.16l below. 

Proof. It follows immediately from Assumption lC. ll that 

E sup |K|jP < Ctfg(uo) ■ 

te[T/A,T] 



Combining this with Proposition 13.51 yields first of the desired bounds with q = p~ r 
Here, ^0 is as m Assumption lC. 1 l and p 1 is as in Proposition l3.5l 

Turning to the bound on dfVt, observe that v satisfies the random PDE 

d t v t = F(v t + GW(t)) = F(u t ) , v = u . 

It follows at once from Proposition 13.51 and Assumption I A. 1 121 that the quoted esti- 
mate holds with q = p 1 +\. More precisely, it follows from Proposition 13.51 that 
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u t G H a for every a < 7* + 1. Therefore, Lu t G 7i 7 for 7 < 7*. Furthermore, 
N G Poly(7i 7+ i, 7i 7 ) by Assumption I A. 1 I2l so that N(u t ) G 7i 7 as well. The claim 
then follows from the a priori bounds obtained in Proposition l3.5l 

Concerning the bound (16.2bl i on the linearization J Sj t, Proposition 13.61 combined 
with Assumption IC. 1 1 proves the result with q = g 7 + 1. The line of reasoning used 
to bound ||<9tt>t|j 7 also controls ||d t J s . t || 7 for s < t and s,t G Is, since dtJ s .t = 
-LJ„ tt + DN{ut)J s ,t. 

Since Proposition ^. 1 1 give an completely analogous bound for K st in Tt as for J„t 
the results on K follow from the a priori bounds in Proposition l3.8l □ 

6.2 A Hormander-like theorem in infinite dimensions 

In this section, we are going to formulate a lower bound on the Malliavin covariance 
matrix Ait under a condition that is very strongly reminiscent of the bracket condition 
in Hormanders celebrated "sums of squares" theorem [Hor85 H6r67|. The proof of 
the result presented in this section will be postponed until Section 1631 and constitutes 
the main technical result of this work. 

Throughout all of this section and Section 16.31 we are going to make use of the 
bounds outlined in Proposition ^. 21 We therefore now fix once and for all some choice 
of constants 

76 [—a, 7*) and /3e[— a,/3*) satisfying 7 + /3>-l. (6.4) 

From now on, we will only ever use Proposition ^. 2l with this fixed choice for 7 and (3. 
This is purely a convenience for expositional clarity since we will need these bounds 
only finitely many times. As a side remark, note that one should think of these constants 
as being arbitrarily close to 7* and /J* respectively. 
With 7 and (3 fixed as in (|6.41 l. we introduce the set 

Poly( 7 , ft M Poly(H 7 , H-p-i) D Poly(W 7+1) H-p) (6.5) 

for notational convenience. (For integer m, Poly m (7, (3) is defined analogously.) A 
polynomial Q G Poly (7, (3) is said to be admissible if 

[Q a ,F a ] GPoly( 7 ,/?), 

for every pair of multi-indices a, a. Here, Q a and F a are defined as in ( 13.41 ) and F is 
the drift term of the SPDE (O) defined in ( ET2l 

This definition allows us to define a family of increasing subsets A; C Poly(7, (3) 
by the following recursion: 

Ai = {g k , k = 1, . . . , d} C U^+i w Poly°(H 7i+1 ) c Poly( 7 , p) , 
Aj+i = Ai U {Qa, [F a , Q a ] : Q G A^ Q admissible, and a, a multi-indices} . 



Remark 6.4 Recall from ( 13.5b that Q a is proportional to the iterated "Lie bracket" of 
Q with g ai , g a2 an d so forth. Similarly, [F a , Q a ] is the Lie bracket between two differ- 
ent iterated Lie brackets. As such, except for the issue of admissibility, the set of brack- 
ets considered here is exactly the same as in the traditional statement of Hormander's 
theorem, only the order in which they appear is slightly different. 
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To each A^v we associate a positive symmetric quadratic form- valued function Qn by 

QeA N 

Lastly for a € (0, 1), and for a given orthogonal projection IT: TL — * TL, we define 
S a cTL by 

S a = {<p&H\{0} : lln^l > a\\<p\\} . (6.6) 
With this notation, we make the following non-degeneracy assumption: 

Assumption C.2 For every a > 0, there exists N > and a function A Q : TL — > 
[0, oo) such that 

. f (<P, QN(u)(p) k2 , 
1Ilf IItt II? - k ^ U ) ' 

ves a ||n^|| 2 

for every u € TL a - Furthermore, for every p > 1, t > anc/ every a £ (0, 1), f/iere 
ex/ifi C 5mc/z that E A~ p (w t ) < C^ p (uq) for every initial condition uq £ TL. 



Remark 6.5 Assumption lC.2l is in some sense weaker than the usual non-degeneracy 
condition of Hormander's theorem, since it only requires Qm to be sufficiently non- 
degenerate on the range of IT. In particular, if IT = 0, then Assumption lC.2l is void and 
always holds with A Q = 1, say. This is the reason why, by choosing for IT a projector 
onto some finite-dimensional subspace of Ti, one can expect Assumption |C2] to hold 
for a finite value of N, even in our situation where only contains finitely many 
elements. 

Remark 6.6 As will be seen in Section [8] it is often possible to choose A Q to be a 
constant, so that the second part of Assumption lC.2l is automatically satisfied. 

When Assumption lC.2l holds. we have the following result whose proof is given in 
Section |6*31 



Theorem 6.7 Consider an SPDE of the type A3.1\l such that Assumptions \A.l\ and \C.l\ 
hold. Let furthermore the Malliavin matrix Ait be defined as in H4.10i and S a as in 
A6.6\l . Let Ti be a finite rank orthogonal projection satisfying Assumption \C.2\ Then, 
there exists 9 > such that, for every a£(0, 1), every p > 1 and every t > there 
exists a constant C such that the bound 

Pf inf (ip,M t <p) <e\\<p\\ 2 ) <C^ ep (u )e p , 
holds for every uq G TL and every e < 1. 

Remark 6.8 If IT is a finite rank orthogonal projection satisfying Assumption lC.2l then 
Theorem l6.7l provides the critically ingredient to prove the smoothness of the density 
of (p^Sx)!^ 1 with respect to Lebesgue measure. Though [BM07| contains a few 
unfortunate errors, it still provides the framework needed to deduce smoothness of 
these densities from Theorem l6.7l In particular, one needs to prove that Iiu t is infinitely 
Malliavin differentiable. Section 5.1 of I1BM07I shows how to accomplish this in a 
setting close to ours, see also IIMP06I . 
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6.3 Proof of TheoremUS 

While the aim of this section is to prove Theorem l6.71 we begin with some preliminary 
definitions which will simplify its presentation. Many of the arguments used will rely 
on the construction of "exceptional sets" of small probability outside of which certain 
intuitive implications hold. This justifies the introduction of the following notational 
shortcut: 

Definition 6.9 Given a collection H = {H e } £ <i of subsets of the ambient probability 
space 0, we will say that "H is a family of negligible events" if, for every p > 1 there 
exists a constant C p such that P(H e ) < C p e p for every e < 1. 

Given such a family H and a statement $ £ depending on a parameter e > 0, we 
will say that "$ £ holds modulo H" if, for every e < 1, the statement $ £ holds on the 
complement of H e . 

We will say that the family H is "universal" if it does not depend on the problem 
at hand. Otherwise, we will indicate which parameters it depends on. 

Given two families Hi and H 2 of negligible sets, we write H = Hi U H 2 as a 
shortcut for the sentence "H e ~ Hf U H 2 f° r every e < 1." Let us state the following 
useful fact, the proof of which is immediate: 

Lemma 6.10 Let H^ be a collection of events with n G {1, . . . , Ce~ K } for some 
arbitrary but fixed constants C and k and assume that P(H^,) = V(Hf) for any pair 
(k, I). Then, if the family {Hf } is negligible, the family {H e } defined by H e = [J n H^ 
is also negligible. 

Remark 6.11 The same statement also holds of course if the equality between proba- 
bilities of events is replaced by two-sided bounds with multiplicative constants that do 
not depend on k, £, and e. 

An important particular case is when the family H depends on the initial condition 
uq to ( 13. U . We will then say that H is "^-controlled" if the constant C v can be bounded 
by C v ^ v {uq), where C v is independent of uq. 

In this language, the conclusion of Theorem l6.7l can be restated as saying that there 
exists 8 > such that, for every a > 0, the event 

inf (<p,M T p) < 4v\\ 2 

is a iff 6 -controlled family of negligible events. Recall that the terminal time T was 
fixed once and for all and that the function $ was defined in Proposition 16.21 We 
further restate this as an implication in the following theorem which is easily seen to 
be equivalent to Theorem l6.7l 

Theorem 6.12 Let If be a finite rank orthogonal projection satisfying Assumption \C.2\ 

Then, there exists 9 > such that for every a G (0, 1), the implication 

ipeS a (<p, M T p) > e\\(p\\ 2 

holds modulo a iff 8 -controlled family of negligible events. 
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6.4 Basic structure and idea of proof of Theorem l6.12l 

We begin with a overly simplified version of the argument which neglects some tech- 
nical difficulties. The basic idea of the proof is to argue that if (Mt<P, f) is small 
then (Qk(uT)f, <p) must also small (with high probability) for every k > 0. This is 
proved inductively, beginning with the directions which are directly forced, namely 
those belonging to Ai. Assumption IC.2I then guarantees in turn that \\H>p\\ must be 
small with high probability. On the other hand, since € 5 , we know for a fact that 
||IT</?|| > a\\ip\\ which is not small. Hence one of the highly improbable events must 
have occurred. 

This sketch of proof is essentially the same as that of Hormander's theorem in finite 
dimensions, see IIMal78l IKS841 IKS85al INor861 |Nua951 . Trying to adapt this argument 
to the infinite-dimensional case, one is rapidly faced with two major hurdles. First, 
processes of the form 1 1— > ( Jt,rg, <p) appearing in the definition of Mt are not adapted 
to the filtration generated by the driving noise. In finite dimensions, this difficulty is 
overcome by noting that 



and then working with Mt instead of Mt- ( Mt is often called the reduced Malliavin 
covariance matrix.) The processes t i— > ( J$lg, </?) appearing there are now perfectly 
nice semimartingales and one can use Norris' lemma |Nor86], which is a quantita- 
tive version of the Doob-Meyer decomposition theorem, to show inductively that if 
(<P,Mttt) is small, then t i— ► (jQ t Q(ut),ip) must be small for every vector field 
Q E Afc. In our setting, unlike in some previous results for infinite-dimensional sys- 
tems MBT05I . the Jacobian Jo,* is not invertible. This is a basic feature of dissipative 
PDEs with a smoothing linear term which is the dominating term on the right hand 
side. Such dynamical systems only generate semi-flows as opposed to invertible flows. 

Even worse, there appears to be no good theory characterising a large enough sub- 
set belonging to its range. The only other situations to our knowledge where this has 
been overcome previously are the linear case |Oco881 , as well as the particular case of 
the two-dimensional Navier-Stokes equations on the torus |iVIP06l] and in [BM07] for 
a setting close to ours. As in those settings, we do not attempt to define something like 
the operator Mt mentioned above but instead we work directly with Mt, so that we 
do not have Norris' lemma at our disposal. It will be replaced by the result from Sec- 
tion [7] on "Wiener polynomials." This result states that if one considers a polynomial 
where the variables are independent Wiener processes and the coefficients are arbitrary 
(possibly non-adapted) Lipschitz continuous stochastic processes, then the polynomial 
being small implies that with high probability each individual monomial is small. It 
will be shown in this section how it is possible to exploit the polynomial structure of 
our nonlinearity in order to replace Norris' lemma by such a statement. 

Another slightly less serious drawback of working in an infinite-dimensional setting 
is that we encounter singularities at t = and at t = T (for the operator Jt,r)- Recall 
the definition of the time interval Is = T — S] from Section [3] We will work on 
this interval which is strictly included in [0, T] to avoid these singularities. There will 
be a trade-off between larger values of S that make it easy to avoid the singularity and 
smaller values of S that make it easier to infer bounds for (Qkiur)^, </?)■ 

When dealing with non-adapted processes, it typical to replace certain standard ar- 
guments which hing on adaptivity by arguments which use local time-regularity prop- 
erties instead. This was also the approach used in 1MP06I IBM07II . To this end we 
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introduce the following Holder norms. For 6 G (0, 1], we define the Holder norm for 
functions / : 1$ — > H by 

1/11 = sup — — , (6.7) 

and similarly if / is real-valued. (Note that even though we use the same notation as for 
the norm in the Cameron-Martin space in the previous section, these have nothing to do 
with each other. Since on the other hand the Cameron-Martin norm is never used in the 
present section, we hope that this does not cause too much confusion.) We furthermore 
set 

1/10.7= SU P r, Tm— ~> 

s,tGl 5 l t_s l 

where || • || 7 denotes the 7th interpolation norm defined in Assumption lA.il Finally, 
we are from now on going to assume that S is a function of e through a scaling relation 
of the type 

5 = %e r (6.8) 
4 

for some (very small) value of r to be determined later. 
6.5 Some preliminary calculations 

We begin with two preliminary calculations. The first translates a given growth of the 
moments of a family of random variables into a statement saying that the variables are 
"small," modulo a negligible family of events. As such, it is essentially a translation of 
Chebyshev's inequality into our language. The second is an interpolation result which 
controls the supremum of a functions derivative by the supremum of the function and 
the size of some Holder coefficient. 



Lemma 6.13 Let 5 be as in i6.8[ with r > 0, let ^ : H — > [1, 00) be an arbitrary func- 
tion, and let Xg be a S- dependent family of random variables such that there exists b G 
R (b is allowed to be negative) such that, for every p > 1, E|X,5| P < C p ~^> p {uQ)8~ bp . 
Then, for any q > br and any c > 0, the family of events 



c 
1 

is ^ i->>r -dominated negligible. 

Proof. It follows from Chebychev's inequality that 



'(l^l 



> — ) < C p cP$ p 5- bp e qp = Q(*^)V , 



c 



where Ci is equal to C p c p with £ — p(q — br). Provided that q — br > 0, this holds for 
every I > and the claim follows. □ 

Lemma 6.14 Let f: [0,T] — > R be continuously differentiable and let a G (0, 1]. 

Then, there exists a constant C depending only on a such that 

I/Hi <q|/lk~max{i ||/||-^ I/lll^}. 
Here, |||/|||q denotes the best a-Holder constant for f. 
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Proof. Denote by Xq a point such that \dtf(xo)\ = \\dtf\\L<*>. It follows from the 
definition of the a-H61der constant ||<9t/||e° that \dtf(x)\ > ^ 1 1 ./* 1 1 i ^° for every x 

such that \x — xq I < (||3 f /|| L oo/2||a f /i| Cc ») 1/Q . The claim then follows from the fact 
that if \dtf{x)\ > A over an interval /, then there exists a point x\ in the interval such 
that |/(xi)| > A\I\/2. □ 

6.6 Transferring properties of tp back from the terminal time 

We now prove a result which shows that if <p € S a then with high probability both 
||ILR't-<5,t¥>|| and the ratio ||ni^T-«5.T<;2||/||ifT-<5,T<^|| can not change dramatically 
for small enough 5. This allows us to step back from the terminal time T to the right 
end point of the time interval Is. As mentioned at the start of this section, this is needed 
to allow the rougher test functions used in Theorem l6.7l 

Lemma 6.15 Let ( |6.3dt hold and fix any orthogonal projection TlofH. onto a finite 
dimensional subspace ofTL spanned by elements of Hi. Recall furthermore the relation 
\6.8$ between S and e. There exists a constant c € (0,1) such that, for every r > and 
every a > 0, the implication 

a 

tp e S a =>■ K T -s,Tf^S ca and \\ILK T -s,T<p\\ > ^IMI , 

holds modulo a ty 1 ^ -controlled family of negligible events. 

To prove this Lemma, we will need the following axillary lemma whose proof is 
given at the end of the section. 

Lemma 6.16 For any S € (0, T/2], one has the bound 

E sup \\K T -8 TP - e- SL <p\\ p < C p «f np (uo)S {1 - a)p , (IQb ) 

lkll<i 

E sup \\K T -s,t<P - < C p y np (u a )S (1 - a)p , D) 

IMI<i 

for every p > 1 and every u £ TL. Here, n is the degree of the nonlinearity N. 

Proof of Lemma Wl5\ We begin by showing that, modulo some ^/''-dominated fam- 
ily of negligible events, 

ot 

||LV|| > a||p|| \\UK t _s,t^\\ > ^ IMI ■ 

By the assumption on IT, we can find a collection {vk}^ =1 in Hi with \\vk\\ = 1 such 
that litp = Vk(vk, <f)- Therefore, there exists a constant C\ = sup fc ||ufe||i so that 
||n<^|| < Cill^H-i. Combining Lemma |6. 131 with Lemma |6.16l we see that 

a 

sup \\K T -s,tP> - < r^r , (6.9) 

modulo a ^ 11 ~ a > r -dominated family of negligible events. Hence, modulo the same 
family of events, 

IIILKr-^r^H > ||II<pj| - Ci\\K T -s,tV> - p\\-i 
> "IMI - ^|M| = ^\\<p\\ ■ 
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Combining now Lemma lS. 131 with ( 16. 3H . we see that 

\\K T -6,T<p\\<\\<p\\ + \\e-* L <p\\<C\\tfi\\, 

modulo a vpa-air -dominated family of negligible events, thus showing that Kt-s,t¥ 
belongs to iS CQ with c = 1 /(2C) and concluding the proof. □ 

We now give the proof of the auxiliary lemma used in the proof of Lemma IB. 151 

Proof of Lemma WT6\ It follows from (13. 13b and the variation of constants formula 
that 

r T 

K T - S .T<P - e~ SL (p = / e- (T - s)L DN*(u s )K SiT <p ds . 

JT-S 

It now follows from Assumption lA.il point 3 that there exists 70 £ [0, 7* + 1) such 
that DN*{u) is a bounded linear map from H to H- a for every u € 70 and that its 
norm is bounded by C | j u 1 1 ™ ~ 1 for some constant C. The first bound then follows by 
combining Proposition ^. 21 with the fact that e~ Lt is bounded by Ct~ a as an operator 
from H-a to TC as a consequence of standard analytic semigroup theory 0Kat8OI . 
In order to obtain the second bound, we write 

\\K T -o,tV - vll— x < \\K T -5,tV - e^Vll-i + ||e~ £ V - Pll-i 
< \\K T -s^tp-e- LS tp\\+CS, 

where the last inequality is again a consequence of standard analytic semigroup theory. 
The claim then follows from d6.3tl >. □ 

6.7 The smallness of Mt implies the smallness of Qn(ut-s) 

In this section, we show that if (Mrf, <p) is small then (Qtf(ui)Kt,T<p, Kt^T'fi} must 
also be small with high probability for every t E Is- The precise statement is given by 
the following result: 

Lemma 6.17 Let the Malliavin matrix Mt be defined as in gTOt and assume that 
Assumptions IA. 1 1 and \C.l\ are satisfied. Then, for every N > 0, there exist rjy > 0, 
Pn > and > such that, provided that r < rjy, the implication 

{<p,M T <p) <e|M| 2 sup sup \(K t ,T<p,Q(ut))\ <e PN \\ip\\, 

holds modulo some ^ qN -dominated negligible family of events. 

Proof. The proof proceeds by induction on N and the steps of this induction are the 
content of the next two subsections. Since Ai = {g±, . . . ,gd}, the case N = 1 is 
implied by Lemma |6 . 1 8 I below. with p± = 1/4, q% = 8, and r% = 1 /(&pp). 

The inductive step is then given by combining Lemmas |6.2 1 1 and 16.241 below. At 
each step, the values of p n and r n decrease while q„ increases, but all remain strictly 
positive and finite after finitely many steps. □ 
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6.8 The first step in the iteration 

The "priming step" in the inductive proof of Lemma 16.171 follows from the fact that 
the directions which are directly forced by the Wiener processes are not too small with 
high probability. 

Lemma 6.18 Let the Malliavin matrix M be defined as in \4.1(M and assume that As- 
sumptions \A. l\ and \C. l\ are satisfied. Then, provided that r < l/(8pp), the implication 

(ip,M T <p) < £|M| 2 =^ sup sup \(K tjT <p, g k )\ < £ 1/4 |M| , 

fe=i...d teis 

holds modulo some "J 8 -dominated negligible family of events. Here, pp is as in ( 16. it'l l 
and (3 was fixed in \6.4\ . 



Proof. For notational compactness, we scale ip to have norm one by replacing ip with 
y>/||^?||. We will still refer to this new unit vector as ip. Now assume that ((p, Mt^p) < 
e. It then follows from ( 14.10b that 



sup / (g k , K ttT ip) 2 dt < e 

k=l...d J If, 



Applying Lemma |6T4| with f(t) = L, 2 | (g k , K St T<p) \ ds and a = 1, it follows that 
there exists a constant C > such that, for every k = 1 . . . d, either 

sup \ {gk,Kt,T<p)\ < £ 1/4 . 

or 

IIKfl^KT^IIi^Ce- 1 / 4 . (6.10) 

Therefore, to complete the proof, we need only to show that the latter events form a 
dominated negli gible family for every fc. Since ||| (g k , K., T tp) |||i < ||# fc ||_0|||K T </>|||i,/3, 
the bound d6.10l l implies that 

sup lK tlT <pfo,fi > — , (6.11) 

ipeH-.\\(p\\=i 9 

where g* = maxfc || (?fe||-/3 (which is finite since we have by assumption that —p < 
7 + 1 < 7* + 1 and since g k G Ti ltr +i for every fc) . This event depends only 
on the initial condition uq and on the model under consideration. In particular, it is 
independent of (p. 

The claim now follows from the a priori bound (I6.3bb and Lemma l6.13l with q = \ 
andb — pf). □ 

6.9 The iteration step 

Recall that we consider evolution equations of the type 

d 

du t = F(u t ) dt + ^2 9kdW k (t) , (6.12) 
fc=i 



where F is a "polynomial" of degree n. The aim of this section is to implement 
the following recursion: if, for a given polynomial Q, the expression (Q(u t ), K tt T<f) 
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is "small" in the supremum norm, then both the expression ([Q, F](u t ), Kt,Tf) and 
(IQ, gk](ut), Kt,Tf) must be small in the supremum norm as well. 

The main technical tool used in this section will be the estimates on "Wiener poly- 
nomials" from Section|71 Using the notation 

W a (t) = W ai (t) W a2 it) ■ ■ ■ W ai it) , 

for a multi-index a = («i , . . . , oti), this estimate states that if an expression of the type 
E|a|<m A a it)W a it) is small, then, provided that the processes A a are sufficiently 
regular in time, each of the A a must be small. In other words, two distinct monomials 
in a Wiener polynomial cannot cancel each other out. Here, the processes A a do not 
have to be adapted to the filtration generated by the Wk, so this gives us some kind of 
anticipative replacement of Norris' lemma. The main trick that we use in order to take 
advantage of such a result is to switch back and forth between considering the process 
u t solution to ( 16.121 ) and the process vt defined by 

d 

v t =u t - ^9kW k it) , 

k=l 

which has more time-regularity than u t . Recall furthermore that given a polynomial Q 
and a multi-index a, we denote by Q a the corresponding term ( 13.51 ) appearing in the 
(finite) Taylor expansion of Q. 

Recall the definition Poly m ( 7 ,/?) = Poly m (H 7 , H-p-i) n Poly m (W 7+ i, H-p). 
We first show that if Q G Poly m (7, (3) and (Qiut), Kt,T<p) is small, then the expression 
(Qaivt), Kt : T<p) (note the appearance of v t rather than u t ) must be small as well for 
every multi-index a: 

Lemma 6.19 Let Q £ Poly m (7, (3) for some m > and for 7 and (3 as chosen in 
A6.4h Let furthermore q > an set q = q'd~ m . Then, the implication 

sup \(Q(ut),K tt T<p)\ < £ q \\y>\\ => sup sup \(Qa(Vt),K tt T<p}\ < £ 9 IMI , 

holds modulo some \j>6(m+i)/q -dominated negligible family of events, provided that 
r < q/i&pfj). 

Proof. Note first that both inner products appearing in the implication are well-defined 
by Proposition 16.21 and the assumptions on Q. By homogeneity, we can assume that 
\\<p\\ = 1. Since Q is a polynomial, ( 13.41 implies that 

(QiiH),K t , T tp) = J2{Q a iv t ),K t! Tv)W a it) . 

a 

Applying Theorem 17. II we see that, modulo some negligible family of events Osc^>, 
sup teJ(5 I (Qiu t ), Kt,T<f) I < e 9 implies that either 

sup sup \(Q a iv t ),K t . T ip)\ < e q , (6.13) 

a tels 

or there exists some a such that 

UQa(.vt),K., T (p)^>e-^ 3 . (6.14) 
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We begin by arguing that the second event is negligible. Since Q is of degree to, there 
exists a constant C such that 

||(Q a Ot),K T <£)|i < SUp \\Kt,T<p\\p+ll\Q a (v.)\ll-p-l + SUp ||(5 Q (Vt)||_ j 3|||ii'. j T< ) £'|||l > /3 

teis tei s 

< C sup \\K t ,T<p\\p+i sup INI^IHi/y + C sup |H|™ +1 |||K r^||i,/3 ■ 
tei s tei s tei s 

Here, we used the fact that Q a G Poly'"(7i 7 , W-i-^) to bound the first term and the 
fact that Q a G Poly m (7i 7+ i , to bound the second term. The fact that Q a belongs 
to these spaces is a consequence of G W 7l +i and of the definition ( 13.41 of Q Q . 
Therefore, ( 16.141 ) implies that either 

X 5 = sup suplli^T^II/mSuplhr'lMlk-y > ^ e " ?73 ( 6 - 15 ) 



7 lll"IIIJ-,7 — 9/"^ 
¥>GH: || v || = l*e/a tgl 4 zo 



y 5 = sup sup IMI^JKt^II!^ > -^e- ?/3 . (6.16) 
ipen-.\\ip\\=iteis ^ 



or 



Combining the Cauchy-Schwarz inequality with Proposition 16.21 we see that X$ and 
Ys satisfy the assumptions of Lemma lS. 131 with $ = <J/" 1+1 and b = pp, thus showing 
that the families of events ( 16.151 ) and ( 16.16b are both ^ 6(m+1) /^-dominated negligible, 
provided that r < q/(Gpp). □ 

In the sequel, we will need the follow simple result which is, in some way, a con- 
verse to Theoreml7.ll 



Lemma 6.20 Given any integer N > and any two exponents < q < q, there exists 
a universal family of negligible events Sup^ such that the implication 

sup sup \A a (t)\ < e q sup ^ A a (t)W a (t) < e q 

a teI * teIsl a -.\ a \<N 

holds modulo Sup^r for any collection of processes {A a (f) : \a\ < N}. 
Proof. Observe that 

sup I J2 AJt)W a (t) < (sup sup \A a (t)\ ) ( su P|Wa|) 

teh a:\ a \<N \ <* teis J \ a:H < N teis J 

Since for any p > 0, 

SU P |W a |>£- p 
a:\a\<N tel6 

is a negligible family of events, the claim follows at once. □ 

As a corollary to Lemmas 16.191 and 16.201 we now obtain the key estimate for 
Lemma 16.171 in the particular case where the commutator is taken with one of the 
constant vector fields: 
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Lemma 6.21 Let Q £ Poly m (7, (3) be a polynomial of degree m and let q > 0. Then, 
for q = q3~ (m+1 \ the implication 

sap \(Q(u t ),K t , Tl p)\ <e q M => sup sup \(Q a (u t ), K t , T p)\ < e?|M| , 
teis a teh 

holds for all ip £ TL modulo some \I> 2(m + 1 )/ ^-dominated negligible family of events, 
provided that r < q/(2pp). 

Proof. Since it follows from (13.41 ) that (Qa)/3 = Qaup, we have the identity 

Qa(Ut) = y~](Qa)0(VtWp = 'YlQaljpivdWp ■ 



Combining Lemma |6.19| and Lemma l6.20l with N = m proves the claim. □ 

In the next step, we show a similar result for the commutators between Q and F. 
We are going to use the fact that if a function / is differentiable with Holder continuous 
derivative, then / being small implies that dtf is small as well, as made precise by 
Lemma 16.141 As previously, we start by showing a result that involves the process v t 
instead of u t : 

Lemma 6.22 Let Q be as in Lemma \6.19\ and such that [Q a ,F a ] G Poly(7, (J) for 
any two multi-indices a, a. Let furthermore q > and set q = q3~ 2m /8. Then the 
implication 

sup \{Q(u t ),Kt,T<p)\ < £ 9 IMI =^ sup sup \{[Q a ,F a ](v t ),K t ^)\ < e 9 \\tp\\ , 

holds modulo some \]> 6 (" l + 1 */ '9 -dominated negligible family of events, provided that 
r < q/ifipp). (As before the empty multi-indices are included in the supremum.) 

Proof. By homogeneity, we can assume that || ip \ = 1. Combining Lemma 16. 19| with 
Lemma 15. 141 and defining q = q'S~ m , we obtain that sup te/s \ (Q(ut), K t ,rp)\ < £ q 
implies for / Q ,^(i) = d t (Q a {v t ), K t ^ T ip) the bound 

sup \f a , v (t)\ < Cmaxie^eilUjffi , (6.17) 
teh 

modulo some ^/ 6 ( m + 1 )/'?-dominated negligible family of events, provided that r < 
q/(6pp). Note that this family is in particular independent of both a and ip. Here and 
in the sequel, we use the letter C to denote a generic constant depending on the details 
of the problem that may change from one expression to the next. 

One can see that (Q a (vt), Kt,T<p) is differentiable in t by combining Proposi- 
tion l6.2l with the fact that Q a G PolyCH 7 , H-i-p) n Po\y(H 7 +i,H-p) as in the proof 
of Lemma 15. 191 See MDL92I for a more detailed proof of a similar statement. 

Computing the derivative explicitly, we obtain 

f aiV (t) = (DQ a (v t )F(u t ) - DF(u t )Q a (v t ), K ttT tp) = (B a (t), K t>T ip) . 
The function B a can be further expanded to 

B a (t) = J2( D Q*( v t)FAvt) - DF a (v t )Q a (v t )) W a (t) = $}Q a , F a ](v t )WAt) ■ 
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Notice that, by the assumption that [Q a , F a ] £ Poly(7, /3), one has 

UQcFrXv.WMh -l-f3 < C(l + SUp \\v t \\ 7 r +m - 2 -W-W\\d t V t \U SUp \W a (t)\ 

+ c\\w a \liO + sup iM^r+^-w-M , 

||[g Q ,F a ]( Wt )^W||^<C7(l + ||^|| 7 +i) n+m - 1 ' |Q| ' |,T| |^ (T (t)| . 

(Here it is understood that if one of the exponents of the norm of vt is negative, the 
term in question actually vanishes.) It therefore follows from Proposition ^. 2| that 

n\B a i\ _,_ 3 < C p ¥ n+m - l Hu ) , E sup \\B a (t)\\ p _ < C p ¥ n+m -^(u a ) , 
s ' p tei s 

for every p > 1 and some constants C p . 

Since the Holder norm of f a>tp is bounded by 

\\(B a (.),K T <p)\h 3 < Mi sup \\K t , T \\ 0+1 + l\K., T l\ hf3 sup \\B a (t)\\-f, , 

3 3 teh 3 teh 

we can use the bounds on B a just obtained, the Cauchy-Schwarz inequality, Proposi- 
tion [5721 and Lemma 15. 131 to obtain 

sup sup 1/o j¥ J?/3 < , (6.18) 

modulo some \l' 12(n+m) / < ?- C iominated negligible family of events, provided that r < 
min{g/12, q/{&pp)}. As a consequence, modulo this family, we obtain from ( 16.17b the 
bound sup a sup tgJj \ f a ,tp(t)\ < Cei which can be rewritten as 



sup sup 



^2([Q a ,F a ](v t ),K t ^)WAt) 



< Ce* . (6.19) 



Since [Q a ,F a ] € Poly(7,/3) the same reasoning as in Lemma 15. 191 combined with 
Theorem 17.11 on Wiener polynomials implies that modulo some negligible family of 
events OscJ^, the estimate ( 16.191 ) implies that either 

sup sup I ([Q a , F^{v t ), K t>T <p) I < e« , (6.20) 

or there exists some a and a such that 

\m^F a \vt\K.^)\x > . (6.21) 

Again following the same logic as Lemma 15. 191 we see that the family of events in 
(I6.2U is $ 6 ( m +D/ 9-dominated negligible provided that r < q/(6p0). □ 

In order to turn this result into a result involving the process ut, we need the fol- 
lowing expansion: 

Lemma 6.23 Given any two multi-indices a and a ( including the empty indices), there 
exist an integer N and a collection of multi-indices {o^, <Ji,Q, '■ i — 1 . . . N} and 
constants {ci : i — 1 . . . N} so that 

N 

[Q a ,F a ]{ut) = Y.^Q^F^ivtWcm 



Spectral properties of the Malliavin matrix 



50 



Proof. First observe that 

c 

The Jacobi identity for Lie bracket states that 

D 9fc [Q ai F a ] = [g k , [Q a , F a ]] = [[g k ,Q a ],F a ] + [Q a , [g k , F a ]] 
= (\a\ + l)[Q aU( fc), F a ] + (\a\ + l)[Q a ,F aU(k) ] . 

By iterating this calculation, we see that for any multi-index £, [Q a , F a ]^ is equal to 
some linear combination of a finite number of terms of the form [Q ai , F ai ] for some 
multi-indices on and <x;. □ 

In very much the same way as before, it then follows that: 

Corollary 6.24 Let Q be as in Lemma \6.19\ and such that [Q a , F a ] € Poly(7, (3) for 
any two multi-indices a, a. Let furthermore q > and set q = q3~ 2( - rn+1> /8. Then the 
implication 

sap \(Q(u t ),K tiT (p)\ <£ 9 |M| =► sup sap \([Q a ,F„](ut),K t , T <p)\ < e*|M| , 

holds modulo some ^ 2(m+1 ^^ ) -dominated negligible family of events, provided that 
r < 3q/(2p ). 

Proof. It follows from Lemma l6.23l that 

N 

([Q a ,F a ](u t ),K T ^) =Y J ^Q^F^]{v t )J< T ^)W 1 M . 

i=l 

Combining the control of the ([Q ai , F ai ](v t ), Kt^) obtained in Lemma 16.221 with 
Lemma 15. 201 gives the quoted result. □ 

6.10 Putting it all together: proof of Theorem l6.12l 

We how finally combine all of the results we have just accumulated to give the proof 
of the main theorem of these sections. 

Proof of Theorem \6.12\ We are going to prove the statement by showing that there ex- 
ists 9 > and, for every a > 0, a ty e -dominated family of negligible events such that, 
modulo this family, the assumption inf^gs^ (ip, M.t<p) < £ IM| 2 leads to a contradic- 
tion for all e sufficiently small. 

From now on, fix TV as in Assumption IC.2I By Lemmas 16.151 and 16.171 we see 
that there exist constants 6, q, ro > such that, modulo some fy e -dominated family of 
negligible events, one has the implication 

f G S a 1 j K T -s,t<P 6 S ca and ||LLffT- ( 5,T</'|| > f |M| 

(ip, Mtv) < e|MI 2 J \ (Kt-s,t<P, Qn(ut-s)Kt-s,tv) < £ 9 IM| 2 , 

provided that we choose r < ro in the definition ( 16.8b of 5. By Assumption lC.2l this in 
turn implies (modulo the same family of negligible events) 

Ct _ -i q 

■■■ g-Hvll < \\nK T - S ,T<p\\ < A^VcO^IMI ■ 
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On the other hand, it follows from Lemma 16.131 and the assumption on the inverse 

4 

moments of A CQ that, modulo some « -dominated family of negligible events, one 
has the bound 



Possibly making 8 smaller, it follows that, modulo some vf" -dominated family of neg- 
ligible events, one has the implication 



which cannot hold for e small enough, thus concluding the proof of Theorem l6.12l □ 

7 Bounds on Wiener polynomials 

We will use the terminology of "negligible sets" introduced in Definition ^. 91 We will 
always work on the time interval [0, 1], but all the results are independent (modulo 
change of constants) of the time interval, provided that its length is bounded from 
above and from below by two positive constants independent of e. This is seen easily 
from the scaling properties of the Wiener process. 

The results of this section are decendents of similar results obtained in (MP06, 
BM07} by related techniques. In 1BM07I it was proven that if a Wiener polynomial, 
with continuous, bounded variation coefficients, is identically zero on an interval then 
so are its coefficients. This is enough to prove the almost sure invertiblity of projec- 
tions of the Malliavin matrix, which in turn implies the existence of a density for the 
projections of the transition probabilities. To prove smoothness of the densities or the 
ergodic results of this paper, more quantitative control is needed. In 1BM071, a result 
close to ( 17.11 ) is claimed. However an error in Lemma 9.12 of that article leave the 
proof incomplete. Arguing along similar, though slightly different lines, we prove the 
needed result below. We build upon the presentation in MBM07B but simplify it signif- 
icantly. (The presentation in [BM07| was already a significant simplification over that 
in IMP06I .) 

Theorem 7.1 Let {Wfe}^ =1 be a family of i.i.d. standard Wiener processes and, for 
every multi-index a = (a\, . . . , eti), define W a = W ai . . . W ae with the convention 
that W a = 1 if a = (f>. Let furthermore A a be a family of (not necessarily adapted) 
stochastic processes with the property that there exists m > such that A a = 
whenever \a\ > m and set = ^ A a (t)W a (t). 

Then, there exists a universal family of negligible events Osc^ depending only on 
m such that the implication 



holds modulo Osc^>. (The supremum norms are taken on the interval [0, 1].) 

Remark 7.2 Informally, we can read the statement of Theorem l7.1l as "if Za is small, 
then either all of the coefficients A a are small, or at least one of them oscillates very 
fast." The exponents appearing in the statement of Theorem l7. 1 l are somewhat arbitrary. 
By going through the proof more carefully, we can see that for any n > 2, it is possible 





sup Q ||A Q || L = < £ 3 ~ 

sup a ||A Q || Li p > e~ 3 



(7.1) 
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to find a constant C K > such that the exponents in d7.U can be replaced by n~ m 
and — C K K~ rn respectively. Here, the coefficient C K tends to as k — ► 2. While 
the precise values of the exponents in d7.U arising from our proof are unlikely to be 
sharp, they are not far from it, as can be seen by looking at processes of the form 
Z(t) = e 1- ^ (Wg(t)—W(t)), where We is the linear interpolation of the Wiener process 
W over intervals of size e 9 . 

Remark 7.3 The reason why the family of negligible sets appearing in this statement 
is called Osc^ is that it relies on the fact that the Wiener processes typically fluctuate 
sufficiently fast on every small time interval so that their effects can be distinguished 
from those of the multiplicators A a which fluctuate over much longer timescales. It is 
important to note that Osc J^- depends on the processes A a only through the value of m. 

Before we start with the proof, we show the following result, which is essentially 
the particular case of Theorem l7. 1 I where m = 1 and where the coefficients A a do not 
depend on time. Here, (-, •) denotes the scalar product in R d . 

Lemma 7.4 Let {Wfe}^ =1 be a collection of Ltd. standard Wiener processes. Then, 
for any exponent n > 0, there exists a universal family Osc^/ of negligible events such 
that the bound 



Remark 7.5 We would like to stress again the fact that the family of events Oscw is 
independent of the choice of coefficients A and depends only on the realisation of the 



Proof. Fix k > and define a family of events B by B £ = {sup tg[0 x] |W(i)| > 
e~ K }. It follows immediately from the fact that the supremum of a Wiener process has 
Gaussian tails that the family B is negligible. Consider now the unit sphere S d in R d . 
For every A £ S , the process W^it) = (A, W(t)) is a standard Wiener process and 
so P(sup te[0 ji | W^4(t)| < 2e R ) < Ci exp(— C2£~ 2k ) for some constants C\ and C2 
that are independent of A. Denote this event by H e A . 

Choose now a collection {Ak} of points in S d such that sup AeSci inffc \A — Ak\ < 
£ 2k and define H e = {J k R\ . Since this can be achieved with 0( e - 2K (™- 1 )) points, 
the family H is negligible by Lemma 16.101 We now define Oscw = H U B and we 
note that, modulo Oscvi/, one has for every A G R d the bound 



sup \(A,W(t))\>e K \A\, 



(7.2) 



te[0,i] 

holds modulo Osc w for any choice of coefficients A G R d . 



W k 's. 



sup \(A,W(t))\ > \A\ inf sup \{A,W(t))\ 
te[0,i] Aes d te[o,i] 




as required. 



□ 



We now turn to the 



Proof of Theorem \7.1\ The proof proceeds by induction on the parameter to. For to = 
0, the statement is trivial since in this case one has Za($) = A^(t), so that one can take 
Osc^ = cj). 
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Fix now a value m > 1 and assume that, for some e, both inequalities 

\\Z a \\l°o < £ , Q) 

SUP HAJup 
|a|<m 

hold. Our aim is to find a (universal) family of negligible sets Osc^- such that, modulo 
Osc^,, these two bounds imply the bound sup a ||A a ||j,°c < e 3 . Before we proceed, 
we localise our argument to Wiener processes that do not behave too "wildly." Using 
the fact that the Holder norm of a Wiener process has Gaussian tails for every Holder 
exponent smaller than 1 /2, we see that the bounds 

,„,,,,, . -i/io \W a (t)-W a (s)\ _ 1/30 
sup sup \W a (t)\<e 1/iU , sup sup ^ <e i/JU , (7.4) 

t6[0,l]|a|<m sjit \a\<m \t ~ S\ ' 

both hold modulo some universal family Wien of negligible events. The reason for 
these particular choices of exponents will become clearer later on, but any two negative 
exponents would have been admissible. 

Choose an exponent k to be determined later and define a sequence of times tg = 
le K for I = 0, . . . , e~ K , so that the interval [0, 1] gets divided into e~ K subintervals of 
the form [tg, tg +1 ]. We define A a = A a (tg) and similarly for W*. We also define the 
Wiener increments Wf(t) = Wi(t) - Wi(t f ) and their products W* = Tl jea W-. With 
these notations, one has for t G [tg, tg + i] the equality 

Z A (t) = Z A (tg) + A e a (W a (t) - Wi) + J2(A a (t) - Al)W a (t) (7.5) 

a^<p ex 
= Z A (tg) A tWl\aWa(t) + 2(^a(*) ~ A t)W a (t) 

= Z A (tg) + YsYl C v ,aAl Ua WZWAt) + £(4»(t) - A e a )W a (t) 

d 

= z A (t e ) + c v,u)Kuti) w ffij(t) + sm , 

for some "error term" Eg that will be analysed later. Here, the combinatorial factor 
Cq.ct counts the number of ways in which the multi-index a can appear in the multi- 
index a U a (for example C(ij),(j) is equal to 2 if i ^ j and 3 if i = j). Using the 
Brownian scaling and the fact that the supremum of a Wiener process has Gaussian 
tails, we see that for every k' < k, the bound 

sup sup sup \Wf(t)\ < e K ' /2 , (7.6) 
e<e-* te[0,£"]je{i,...,d} 

holds modulo some universal family Wien K '. m of negligible events. 

Note now that all the terms appearing in Eg are (up to combinatorial factors) either 
of the form A l aU<T W^W a {t) with \a\ > 2, or of the form (A a (t) - A' a )W a {t). To- 
gether with ( 17.61 ) and the first bound in ( 17.41 ), this shows that there exists a constant C 
depending only on m such that d7.3bl ) implies 

sup sup \E e (t)\<C{e K '- 1/27 - 1/10 + s K - 1/9 - 1/1Q ) , (7.7) 

1<.E-*- t£[t e .t e+1 ] 
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modulo Wien K / m . Here we used the fact that d7.3bb implies in particular that the bound 
||^4q||l°° < e -1 / 27 holds for every a with \a\ > 2 (note that these terms are non-zero 
only if m > 2) and that HAtjIjup < £ _1 ^ 9 , since we assumed m > 1. At this point, 
we fix k = | and n' = |, so that in particular both exponents appearing in are 
greater than 1. We then define Wien' = WienU Wien K < m so that, modulo Wien', d7.3M 
and ( 17.5b imply 

d 

SU P \y2y2 C ^U)<uu) W aWj(t) <2e+ sup sup \E e (t)\<Ce. 

te[t t ,t t +iV a J=1 £<e-" t&[t e ,t t+1 ] 

(7.8) 

The left hand side of this expression motivates the introduction of operators Mj acting 
on the set of families of stochastic processes by 

(MjA) a = C a ,(j)A aU(j) . 

Note that Mj lowers the "degree" of A by one in the sense that if A a — for every 
|a| > m, then (MjA) = for every \a\ > m — 1, 
With this notation, we can rewrite ( 17.8b as 



sup \Y^Z Mj A(k)Wj(t) 
t£[te,t e+1 ] j—i 



< Ce . (7.9) 



Using the Brownian scaling and applying Lemma |7T4l combined with Lemma 16.101 
shows the existence of a family Oscw of negligible events such that j7.9b implies 

\Zm 3 a(U)\ < e 7 ' 20 , W < e- 5 ' A . 

Here, we used the fact that our choice of k implies that 1 — re/2 > 7/20. This shows 
that the statements ( 17.31 ) imply 

\\Z Ms a\\l°° <e 7/20 + C m sup(e K ||AJup||W Q || L » + e 2fi/b \\A a \\ Lx \\W a \\ c ys) 

a 

< e 7/20 + Cm(£K -l/9-l/0 + ^1/2-1/9-1/30) < ^7/20 > (? 1Q) 

modulo Wien' U Oscjy- Here, the constant C m > 1 depends only on m. 

We now finally arrived at the stage where we are able to apply our induction hy- 
pothesis to each of the processes Zm^a- Note that since 7/20 > 1/3, ( 17.3bl > implies 
that 

suplKMj^Uup < {C m e 7 ' 20 r^ m , 

Oi,j 

for all sufficiently small e. Therefore, outside of the event (Osc^ _1 ) c e 7/2o, one has 
the implication 

{ sup \\Zm.aWl~ < C m e 7 ' 20 ) & (sup ||A Q || Lip < £ -^ im+1) \ 

(7.11) 

=>■ {suplKM^IUo. < C' m e* 3 ~ m ' \ , 

a,j > 

for some different constant C' m depending also only on m. Since 7/20 > 1/3 and 
since ||(MjA) a ||.£,oa > ||A Q uo')IU°°' ^ s implies in particular that ||.A a ||£°o < e 3 
for every a ^ <f>. 
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In order to conclude the proof of the theorem, it therefore only remains to obtain 
a similar bound on ||A^,||i,oo. We define a family of negligible events Wien", so that 
Wien' C Wien", and such that the bound 

sup sup |W a (i)| < e-^ 3 ^ 1 ' , (7.12) 

te[0,l] \a\<m 

holds modulo Wien" r We claim that if we define recursively 

(Osc^) £ = (Osc£r% e7 / 2 o U (Wienl), , 

the family Osc^, has the requested properties. It follows indeed from ( 17.3hl l. (17.12l i and 
the definition of Za that, modulo Osc^v, ( 17.3b imply the bound 

WML- < £ + E ll^a|U-l|W„||L- < E + C ^^/ 2 °- 1 /70)3— (? 13) 

a=£<t> 

Since we choose the bound ( 17.12) in such a way that 7/20 — 1/70 > 1/3, we obtain 
ll^lli 00 < e 1 ^ 3 for sufficiently small e. Together with the remark following ( 17.1 lb . 
this concludes the proof of Theorem l7.ll □ 

8 Examples 

In this section, we apply the abstract framework developed in this article to two more 
concrete examples: the stochastic Navier-Stokes equations on a sphere and a class of 
stochastic reaction-diffusion equations. The examples are chosen in order to highlight 
the techniques that can be used to verify the assumptions of our results and to get some 
idea of their scope of applicability. In particular, the Navier-Stokes equations provide 
an example where bounds on the Jacobian are not very uniform, so that an initial condi- 
tion dependent control is required in Assumption lC.il The stochastic reaction-diffusion 
system on the other hand satisfies very strong a priori bounds, but Assumption I A. 1 1 is 
not verified with the usual choice TL = L 2 , so that one has to work a bit more to fit the 
equations into the framework presented here. 

In both of our examples, the Hormander-type assumption, Assumption IC.2I will 
be verified by using constant vector fields only. Before we turn to the examples them- 
selves, we therefore present the following useful little lemma that will be used through- 
out this section: 

Lemma 8.1 Let Ti be a separable Hilbert space and {gi}f^i C TL a collection of 
elements such that its span is dense in TL. Define a family of symmetric bilinear forms 
Q n on TL by (h, Q n h) = X)"=i(.9i> ^) 2 - Let II: TL — > TL be any orthogonal projection 
on a finite-dimensional subspace ofTL. Then, there exists N > and, for every a > 
there exists c a > such that (h, Q n h) > c a \\Tlh\\ 2 for every h S TL with \\Tlh\\ > 
a\\h\\ and every n > N. 

Proof. Assume by contradiction that the statement does not hold. Then, there exists 
a > and a sequence h n in TL such that ||II/i n || = 1, \\h n \\ < a" 1 , and such that 
\im n _^o(h n , Q n h n ) — ► 0. Since < oT 1 is bounded, we can assume (modulo ex- 
tracting a subsequence) that there exists h e TL such that h n — > h in the weak topology. 
Since II has finite rank, one has \\Uh\\ = 1. Furthermore, since the maps h i— > (h, Q n h) 
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are continuous in the weak topology and since n i— ► (h, Q n h) is increasing for every 
n, one has 

(h,Q n h) = lim (h m , Q n h m ) < lim {h m , Q m h m ) = , 

m — >oo m — >oo 

so that (h, gi) = for every i > 0. This contradicts the fact that the span of the g^ is 
dense in Ti. □ 

8.1 The 2D Navier-Stokes equations on a sphere 

Consider the stochastically forced two-dimensional Navier-Stokes equations on the 
two-dimensional sphere S 2 : 

du = vAudt + vRicudt-V u udt-Vpdt + QdW(t) , divu = 0. (8.1) 

Here, the velocity field u is an element of i/ 1 (S' 2 , TS 2 ), V u u denotes the covari- 
ant differentiation of u along itself with respect to the Levi-Civita connection on S 2 , 
A = — V*V is the (negative of the) Bochner Laplacian on S 2 , and Ric denotes the 
Ricci operator from TS 2 into itself. In the case of the sphere, the latter is just the mul- 
tiplication with the scalar 1. See also |Tay92 ITW93I Nag97) for more details on the 



Navier-Stokes equations on manifolds. 

As in the fiat case, it is possible to represent u uniquely by a scalar "vorticity" field 
w given by 

w = curlu = — div(n A u) , (8.2) 

where n denotes the unit vector in R 3 normal to the surface of the sphere (so that n/\u 
defines again a vector field on the sphere). With this notation, one can rewrite ( 18.11 ) as 



dw = vAw dt - div(w Kw) dt + G dW(t) . (8.3) 

Here, we denoted by K the operator that reconstructs a velocity field from its vorticity 
field, that is 

u = Kw = — curl A~ 1 w = n A VA^'iu , 

and A denotes the Laplace-Beltrami operator on the sphere. See BTW931 for a more 
detailed derivation of these equations. In order to fit the framework developed in this 
article, we assume that the operator G is of finite rank and that its image consists of 
smooth functions, so that the noise term can be written as 

n 

GdW(t) = Y,9i dWi{t) , ^ G J ff°°(5 2 ,R) . 

i=i 

We choose to work in the space H = L 2 (S 2 ,Tl) for the equation ( 18.3b in vorticity 
formulation, so that the interpolation spaces Ha coincide with the fractional Sobolev 
spaces H 2a (S 2 , R), see l|Tri86l . In particular, elements w 6 H a are characterised by 
the fact that the functions x t— > ip(x)w('tp(x)) belong to H 2a (R 2 ) for any compactly 
supported smooth function ip and any function tp : R 2 — > S 2 which is smooth on an 
open set containing the support of ip. Since the sphere is compact, this implies that the 
usual Sobolev embeddings for the torus also hold true in this case. 
Define now Aq = {gi : i = 1, . . . , n} and set recursively 



A„ + i = A n U {B(v, w) : v, W € A n } , 
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where we made use of the symmetrised nonlinearity 

B(v, w) = \ (div(iu Kv) + div(v Kw)) . 
We then have the following result: 

Theorem 8.2 If the closure of the linear span of = U„>q A n * s eaua ^ to a ^ °f 
L 2 (S 2 , R), then the equations i8.H have a unique invariant measure. 

Proof. The main step in the proof is to check that we can apply Theorem |5.4| to con- 
clude that the Markov semigroup generated by the solutions to ( 18.lt has the asymptotic 
strong Feller property. Let us first check that the Navier-Stokes nonlinearity on the 
sphere does indeed satisfy Assumption I A. 1 1 for some a £ [0, 1). It is clear that the 
nonlinearity N, defined by N(w) = B(w, w), is continuous from Hoc to Hoc (which 
coincides with the space of infinitely differentiable functions on the sphere), so in order 
to show point 2, it remains to show that TV maps H 7 into H 7 - a for a range of values 
7 > and some a £ [0, 1). 

Setting B(w,w') = div{w Kw') so that N(w) = B(w,w) = B(w,w), one can 
show exactly as in ||CF 881 that, for any triplet (si, S2, S3) with s, > 0, Y2% s i > 1> one 
has bounds of the type 

/ v(x)B(w,w')(x)dx<C\\v\\ H °i\\w\\ H i + , 2 \\w'\\ Ha3 -i , 
Js 2 

/ v(x)B(w,w')(x)dx<C\\v\\ H i+. 1 \\w\\ H ^\\'w'\\ H s 3 -i , 

J S 2 

for some constant C depending on the choice of the Sj. In particular, B can be in- 
terpreted as a continuous linear map from ri®ri into ri_ 3 (for example) and from 
Hi <8> Hi into H (using the usual identification of bilinear maps with linear maps 
between tensor products). It thus follows from the Calderon-Lions interpolation theo- 
rem as in Remark 1331 that B is a continuous linear map from H Q ® H a into H/3 for 
f3 = ^ - § and a £ [0, |]. For a > \, we use the fact that H Q is an algebra IITri92l 
to deduce that B is continuous from Ti a (g) H a into H a _ 1 . This shows that point 2 of 
Assumption lA.ll is satisfied with a = | (any exponent strictly larger than i would do, 
actually) and 7* = +00. 

Turning to point 3 of Assumption IA.1I it suffices to show that, for v sufficiently 
smooth, the map w 1— > B(v, w) is bounded from H-p into H_;3_ a . It is well-known 
on the other hand that if v £ C k then the multiplication operator w 1— > vro is continuous 
in i? s for all |s| < fc. It follows immediately that DN*(v) is continuous from H_/3 
into Ti_p_i, provided that v £ C k for fc > 2/3. Point 3 then follows with /3* = 00. 

For any fixed 77 > 0, it follows from IHM06I Lemma 4. 10] that Assumption lC.ll is 
verified with ^o(w) = exp(77|jw|j 2 ). Furthermore, the density assumption that we made 
on Aqo allows us to apply Lemma [8T1 so that Assumption lC.2l holds. We conclude by 
Theorem 16. 7 1 that the bound ( 15.11 ) holds with U(w) = exp(i]\\ w\\ 2 ), again for arbitrary 
77 > 0. If 77 is chosen sufficiently small, then it follows again by 1 1 IM06 Lemma 4. 10] 
that U is a Lyapunov function for the system, so that Assumption lB.ll is satisfied. 

Assumptions IB . 21 and IB . 3 1 can again be checked for V(w) = \\w\\ 2 in exactly the 
same way as IHM06, Lemma 4.10]. Finally, Assumption lB.4l can be checked just as in 
1HM06I Lemma 4.17], provided that we choose the projection II to be the orthogonal 
projection onto the first M eigenfunctions of the Laplace-Beltrami operator A for M 
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sufficiently large. This concludes the verification of the assumptions of Theorem 15. 41 
and the claim follows. □ 

Remark 8.3 Just as in 1HM06I . this result is optimal in the following sense. The 
closure A^ of the linear span of A^ in L 2 is always an invariant subspace for ( 18.3b 
and the invariant measure for the Markov process restricted to A^ is unique. However, 
if Aqo 7^ L 2 , then one expects in general the presence of more than one invariant 
probability measure in L 2 at low values of the viscosity v. 

8.2 Stochastic reaction-diffusion equations 

In this section, we consider a general class of reaction-diffusion equations on a "nice" 
domain D. The dimension m of the ambient space is chosen smaller or equal to 3 for 
technical reasons. However, the number £ of components in the reaction is arbitrary. 
The domain D is assumed to be either of 

• A compact smooth m-dimensional Riemannian manifold. 

• A bounded open domain of R m with smooth boundary. 

• A hypercube in R m . 

We furthermore denote by A the Laplace (resp. Laplace-Beltrami) operator on D, 
endowed with either Neumann or Dirichlet boundary conditions. With these notations 
in place, the equations that we consider are 

d 

du = Audt + f oudt + y^ 9* dWi{t) , (8.4) 

with u(t) : D — > R e and / : R — * R e a polynomial of arbitrary degree n with n > 3 
an odd integer. (We exclude the case n = 1 since this gives rise to a linear equation 
and is trivial to analyse.) The functions gi describing the stochastic component of the 
equations are assumed to belong to Tioo, the intersection of the domains of A Q in 
L 2 (D) for all a > 0. It is a straightforward exercise to check that d8.41 i has unique 
local solutions in £ = C(D, ~R e ) for every initial condition in C(D, R £ ) (replace C by 
Co in the case of Dirichlet boundary conditions). In order to obtain global solutions, 
we make the following assumption on the nonlinearity: 

Assumption RD.l Writing f = XX=o fkf or I with fj~ being k-linear maps from R f 
to itself, we assume that n is odd and that 

(/„(«, ...,u,v),v) < 0, 

for every ii,»€R' \ {0}. 

Remark 8.4 Provided that Assumption IRD.ll holds, one can check that there exist 
positive constants c and C such that the inequality 

(f(u + v),u) < C(l + \\v\\ n+1 ) - c\\u\\ n+1 , (8.5) 

holds for every u,v £ R e . 

Essentially, Assumption IRE), ll makes sure that the function u ^ \u\ 2 is a Lya- 
punov function for the "reaction" part u = f(u) of ( 18.4b . In the interest of brevity, 
we define Sup t ^(v) = 1 + sup s<t ||v(s)||£ for any function v <G L°°{[0,t],£) and 
Sup t r (v) = 1 + sup s<t ||w(s)||_ff r for v G L°°([0, t], H r {D)), As a consequence of 
Assumption IRD . 1 1 we obtain the following a priori bound on the solutions to (18. 4k 
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Proposition 8.5 Under Assumption \RD.1\ there exist constants c and C such that the 
bound 

holds almost surely for every uq £ £, where £ is either C(D, R^) or Cq(D, R^), de- 
pending on the boundary conditions of IS.. In particular, for every to > there exists a 
constant C such that one has the almost sure bound 

||«(t)|U-<CSup t)00 (WA), (8.6) 
independently of the initial condition, provided that t > to. 

Proof. The proof is straightforward and detailed calculations for a variant of it can be 
found for example in |Hai08 |. Setting v = u — W&(f) where Wa is the "stochastic 
convolution" solving the linearised equation (18.41 with / = 0, and defining V(v) = 
|| u || |,oo, we obtain from (18. 5t the almost sure bound 

jV{v{t)) < CSup^+HWa) - cV (n+1)/2 (v(t)) . 
In particular, there exist possibly different constants such that 

jV(v(t)) < -cV (n+1)/2 (v(t)) 

for all v such that V(v(t)) > CSup t 2 ^(W A )). Since we assumed that n > 3, a simple 
comparison theorem for ODEs then implies that 

where we set a = 2/n. The requested bound then follows at once. The second bound 
is an immediate consequence of the first one. □ 

Remark 8.6 The function t i— > V(v(t)) is of course not differentiable in time in gen- 
eral. The left hand side in (18.61 l should therefore be interpreted as the right upper Dini 
derivative lim sup /l ^ 0+ + h)) - V(v(t))). 

In order to fit the framework developed in this article, we cannot take L 2 as our 
base space, since the nonlinearity will not in general map L 2 into any Sobolev space 
of negative order. However, provided that k > to/2, the Sobolev spaces H k form an 
algebra, so that the nonlinearity u t— > N(u) = f o u is continuous from H k to H k in 
this case. It is therefore natural to choose H = H k for some k > to/2. In this case, 
for a > 0, the interpolation spaces H a coincide with the Sobolev spaces H k+2a , so 
that one has N £ Poly(7i a , Ti a ) for every a > 0. This shows that Assumption lA.il 
is satisfied with a = 0, 7* = 00 and /3* = 00. It turns out that it is relatively easy to 
obtain bounds in the Sobolev space H 2 . From now on, we do therefore assume that the 
following holds: 



Assumption RD.2 The space dimension to is smaller or equal to 3. 
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This will allow us to work in H = H 2 . Before we state the main theorem of this 
section, we obtain a number of a priori bounds that will allow us to verify that the 
assumptions from the previous parts of this article do indeed apply to the problem at 
hand. 

By using a bootstrapping argument similar to Proposition 13.51 we can obtain the 
following a priori estimate: 

Proposition 8.7 Assume that Assumptions \RD.l\ and \RD.2\ hold. If u is the solution to 
( I&4I ) with initial condition uq € H 2 then there exists a constant C such that the bounds 

\\u(t)\\ H * < CSu V 2 %(u)(\\u \\ H 2 + Swp t>2 (W A )) , 
\\u(t)\\ H2 < CSu V 2 %(u)(±\\u \\ L 2 + Sup ti2 (W A )) , 

hold for allt < 1 almost surely. 

Proof. From DuhameFs formula, we obtain the bound 

Ht)\\ H i < \\u \\ H i + f -7=2=11/ o u(s)\\ L 2 ds + Sup t ^Wa) 
Jo V'-s 

< ||u || H 2 + CVtSupl^iu) + Sup M (W A ) • 

At this stage, we use that since f is a polynomial of degree ti 9 there exists a constant C 
such that 

11/ o uii^i < c(i + IMI^ + imi^imm . (8.7) 

Using Duhamel's formula again, this yields 

\\u(t)\\ H 2 <\\u Q \\ H 2 + f -2=\\fou(s)\\ H ids + Sup t2 (WA) 

Jo Vt - s 

< \\u \\ H 2 + [ -£={1 + \\u{s)\\l x + \\u( S )\\lz}\\u(s)\\ H i) ds 

J a Vt- s 

+ Sup ti2 (W A ) 

< \\u Q \\ H 2+J^ -^(sup^C^ + Sup^C^GIuolliP 

+ V^Supl^iu) + Sup til (W A ))) ds + Sup t2 (W A ) ■ 

Integrating the last term yields the first bound. The second bound can be obtained in 
exactly the same way, using the smoothing properties of the semigroup generated by 
the Laplacian. □ 

As a consequence, we obtain the following bound on the exponential moments in 
H 2 of the solution starting from any initial condition: 

Proposition 8.8 For every T > 0, there exists a constant C > such that 

Erap(||u(T)||^ n )<<7, 
for every initial condition uq £ H 2 . 
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Proof. Without loss of generality, we set T = 1. Combining Proposition 18.71 and the 
Markov property, we see that there exists a constant C > such that 

IKi)IU2<c( sup ||«(s)||2-o)(||«(|)IU» + Sup 1)2 (Wa)) 

|< s <i 

<c( sup ||«(s)||2-)Siip 1j2 (Wa) ■ 

The requested bound then follows from ( 18.6b and the fact that Sup x 2 (Wa) has Gaus- 
sian tails by Fernique's theorem. □ 

We now turn to bounds on the Jacobian J for ( 18.41 ). Recall from ( 13.10b that, given 
any "tangent vector" £, the Jacobian J s ,t£, satisfies the random PDE 

- J s 4 = AJ s . t t; + (Df o u)(t)J s ^ , 
at 

where Df denotes the derivative of the map /. Our main tool is the fact that, from 
Assumption lRD.il we obtain the existence of a constant C > such that 

(Df(u)v,v) < C\v\ 2 , 

for every u, v € R £ . In particular, we obtain the a priori L 2 estimate: 

J t W J *Mh = -nVJsM\h + 2 ( J s,t£,(Df°u)(t)J s ,tO < 2C\\J s ^\\h . (8-8) 

so that ||J s ,t£||z,2 < e C( *~ s) ||£||£2 almost surely. We now us similar reasoning to obtain 
a sequence of similar estimates in smoother spaces. 

Proposition 8.9 For any uq g H 2 , the Jacobian satisfies the operator bounds 

||-7«,t|U 2 -»£ a < C 

II JsAh^m < c {-j= + Su P"o») . 

\\Js,t\\H^Hi <CSup t %(u), 

Ktllfl^fl" < CSup^C^dluollfla +Sup ti2 (W A )) 

IKtllff 1 --^ < CSup^o(«)(ll«ollfl» +Sup t2 (W A )) 



c 



/or < s < £ < 1 w/f/i Sup t ^ defined just before Proposition \8.7\ 



Proof. The first estimate is just a rewriting of the calculation before the Proposition. As 
in the proof of the a priori bounds for the solution, we are going to use a bootstrapping 
argument, starting from the bound (18.81 1. Applying Duhamel's formula and using the 
notation Sup f as before, we obtain 

WJsMw <U\\&+ I -^=\\{Dfou){r)J s . r £,\\ L 2dr 
<U\\m(l + Sup?fJ(u)J*-j2= 
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< neii^csup^ 1 ^!*-^ . 

(And similarly for the second bound.) Regarding the H 2 norm of the Jacobian, we use 
the fact that there is a constant C such that the bound 

\\Df(u)v\\ m < c(\\u\\i-J\\v\\ m + || u ||2- 2 ||v u .|| i ,||«!| i ,) 
< c|MI2^ 2 N|iHMUi 

holds. Hence we get similarly to before 

\\JsMh? < U\\h» + I -£=\\{Dfou){r)J s>r <i\\ H idr 
Js y/t-r 

<U\\m+ I -^=\\u(r)\\l-J\\u(r)\\ H 4J s A\\H^dr 

< CSup^»(|Kllf^ +Sup t)2 (W A ))||^|| J/2 , 

which is the requested bound. To obtain the last bound, one proceeds identically except 
that one used \\e L< t- s) £\\ H i < C\\£\\ m /y/t=s. □ 

We now turn to the second variation. 

Proposition 8.10 For any uq £ H 2 , the second variation J (2 > of the solution to ( 18.4b 
satisfies 

\\J%\\h> 9 h*^h> < CSupl%(u)(\\uo\\ H 2 +Sup t2 (W A )) 4 
for < s < t < 1 with Sup t ^ defined just before Proposition \8.7\ 
Proof. Again using Duhamel's formula, we have 

J (2 }( V ^)= [ J r , t D 2 F(u r )(J s ^,J s ^)dr . (8.9) 

J S 

To control the H 2 norm we will need the following estimate: 

\\V 2 D 2 F(u)( l p,^)\\ L 2 < C(l + |M|^- 2 )(||(V 2 U )(^|| L 2 + \\(Vu) 2 <p^\\ L 2 



||(V 2 ^||l2 + mv 2 V)||z- + ||(v^)(VV)|| 



oo 

i2 IS S,l-I 1S\\ 

L 2 

+ \\(Vu)(V^)<p\\ L 2 + ||(Vu)(V^IM 

<C(l + || U ||^ 2 )(|| U || ff2 ||^||^||V.'||L~ + h||^||^||L-||^l|L~ 

+ IMMMU- + Ml«>W\\h' + Mh'WWh* 
+ IMIiHMU°°IMIff 2 + IMMMUHIV'IIl") 

< C(l + \\u\\^)(l + \\uf H 2)\\<p\\ H 2\\lP\\ H 2 . 

In this estimate, we have used repeatedly the fact that ||i>||l 4 < CIMI-tf 1 an d |M|z,«> < 
C||«||jj2. Using this estimate in d8.9t , we obtain 

HJ^^VOlk 2 <CSup^ 2 ( U ) J {l + \\u r \\%,)\\J r A H ^ H A\JsM\HA\JsA\\H 2 dr 

< CSuplJ™(u)(\\u \\ H 2+Su Pt ^W A )) 5 M H 4^\\ H 2 , 
which completes the proof. □ 
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We now set the stage to prove the analogue of Theorem 15 .41 for equation ( 18.4b . We 
begin by collecting a number for relevant results implied by the preceding calculations. 
We will work in H 2 since this will be the base space for what follows. 

Proposition 8.11 Define V(u) = ||uj|^ 2 ". Then, for every p > there exists a constant 
C p and, for every rj > and p > there exists a constant C VtP so that the bounds 

E sup ||«(t)||^ < C p 

l/2<t<l 

E SUp II^S,*llj?2->iJ2 

<C P 

l/2<s<t<l 

sup E|| J s .t\\ p H 2^ H 2 < exp(tipV(u) + pC v . p ) 

0<s<t<l 

sup E||J$|| 

0<s<t<l 

hold for all uq € H 2 . 

Proof. The first two bounds are a consequence of Propositions l8.81l8.9| and [831 
In order to get the second two bounds, note that 

SUp njf}\\ P H ^H^H 2 < CpQ- + \\u \\H 2 ) {17n+5)P 
0<s<t<l 



exp (x 1 7n + 5)p log( 1 + j | u \ \ H i ) + log C p 



as a consequence of Propositions 18. 101 18.81 and 18.51 A similar bound holds for J. 
Since, for any positive q,r,r],K there exists a C qtr>rtt K so that q log(l + x) + log(K) < 
■qx r + Cq >r r , k for all x > 0, the quoted bound holds. □ 

We assume from now on that the used in the definition of the forcing all belong 
to H 4 . We now construct a particular subset of the A„ defined in Section lBTSl using on 
the highest degree nonlinear term. By doing so we obtain only constant vector fields, 
thus trivializing Assumption lC.2l in light of Lemma l8. 1 1 Setting Ai = {ffi, ■ • • , go], we 
define recursively A/j+i = AkU{F n (hi, • ■ • , h n ) : hj G A^} and Aoo = (J A^. Notice 
that since g^ G H 4 , we know that all of the A n C H 4 since H 4 is a multiplicative 
algebra in our setting. 

Proposition 8.12 If 5/?an(A oc ) is dense in H 2 then given any H 2 -orthogonal projec- 
tion II onto a finite dimensional subspace, there exists 8 > such that Assumption \BJ\ 
holds with U(u) = 

Proof. Proposition 18.111 guarantees that all of the assumptions of Theorem 16.71 hold 
except Assumption IC. 21 However since by construction all of the vector fields in A n 
are constant Assumption IC.2I clearly holds with A a constant if II is an orthogonal 
projection onto a subspace of span(A„). Lemma lOl furthermore shows that it actually 
holds for any finite rank orthogonal projection. □ 

Let ITj\/ be the projection on the eigenfunctions of the Laplacian with eigenvalues 
smaller than M 2 . We will now restrict ourselves to such a projection since it allows 
for easy verification of the pathwise smoothing/contracting properties needed for As- 
sumption lB.4l We have indeed the following bound: 
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Proposition 8.13 Given any positive rj, r and p, there exists a C, hr ^ p so that the bound 

E|| J ,inf/||^2^ H 2 < exp(p?7||uo||ff2 -plog(M) + pC v , r , P ) 
holds for all uq £ H 2 , and all M G N. 
Proof. First observe that 

ll^o,in M ||jj 2 ^j? 2 < ||Jo,i||jJ i ^.ij2||n^||iyi^j ? 2 < M~ x \\ Jo,\\\m^m 

< CM- 1 Sup^ oo (\\u \\h' +Sup li2 (W A )) 

< CM^Sup^iWA^+WWoW^ + l) 4n+1 

Raising both sides to the power p, taking expectations, and using the fact that the law 
of Sup 1 2 (Wa) has Gaussian tails, we obtain 

E|| Jo,in^i|^ 2 ^ 2 < exp(p(4n+ l)log(l + ||u ||) - plog(M) + pC p ) . 

The claim now follows from the fact that, for any rj > and r > 0, there exist a C v . r 
with (4n + 1) log(l + x) < r/x r + C v . r for all x > 0. □ 

Theorem 8.14 Let Vt be the Markov semigroup on H 2 generated by (18.4b . If the 
linear span of Aoo is dense in H 2 then, for every orthogonal finite rank projection 
II : H 2 — > H 2 , for every p > 0, and for every a > 0, there exists a constant C(ot,p, II) 
such that the bound ( 15.71 ) on the Malliavin matrix holds with U = 1. 

Proof. The result follows from Theorem 16.71 One can check that Assumption I A. 1 1 
holds with H = H 2 , a = 0, 7* = /3* = 00 since H e is a multiplicative algebra for 
every £ > 2 (this is true because we restricted ourselves to dimension m < 3). Since 
the most involved part is the assumption on the adjoint, part [3] we give the details for 
that one. One can verify that the adjoint of DN(u) in H acts on elements v in Ti.°° as 

DN*(u)v = A- 2 f(u)A 2 v . 

(This is because H is the Sobolev space H 2 and not the space L 2 .) The claim then 
follows from the fact that the multiplication by a smooth enough function is a bounded 
operator in every Sobolev space H l with I € R. 

Since Assumption IC.2I (with A a a constant depending on IT) can be verified by 
using Lemma [8Tl it remains to verify Assumption lC. 1 1 with v&o = 1. This in turn is an 
immediate consequence of Proposition l8.11l □ 

Combining all of these results, we finally obtain the following result on the asymp- 
totic strong Feller property of a general reaction-diffusion equation: 

Theorem 8.15 Let Vt be the Markov semigroup on H 2 generated by ( 18.41 ) and let 
Assumptions \RD. T\ and \RL\2\ hold. If the linear span of A^ is dense in H 2 then, for any 
C > 0, there exists a positive constant C so that for every u £ H 2 , and ip : H 2 — > R 
on has 

\\DTMu)\\ L 2^ R < CflMli. + e-<* sup ||D^)|| h ^r) . (8.10) 

v£H 2 

In particular, Vt has the asymptotic strong Feller property in H 2 . 
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Remark 8.16 As a corollary, we see that for the semigroup on £, one has 

||DPt^u)|| < C(|M|i» + e- ct ||D^|| L oo) , 

where all the derivatives a Frechet derivatives of functions from £ to R. 

In particular, in space dimension m = 1, the same bound is obtained in the space 
H 1 since one then has H 1 C £. 

Proof. The result follows from Theorem 15.41 Fix II = II a/, the projection onto the 
eigenfunctions of A with eigenvalues of modulus less than M 2 . The constant M is 
going to be determined later on. Assumption IB. 21 with V(u) = \\u\\^ n and ?/ = 
follows immediately from Proposition [O] Fix any p > 10 and any positive r) < 1/p. 
Assumption IB. 31 then follows from Proposition 18.1 II It then follows from Proposi- 
tion Hjj] that we can choose the value of M in the definition of II sufficiently large so 
that Assumption IB ,4l holds and such that (Cn — Cj)/2 — tjCl > C Since, in view of 
Theorem 18. 141 Assumption IB. 1 1 holds with U = 1, we thus obtain from Theorem l5.4l 
the bound 

\\DVMu)\\ H 2^ R < Ce"" tt " 1/n (||v>|U~ + e~ ct sup \\Bip(v)\\ H 2^ R ) . (8.11) 

v<£H 2 

In order to obtain (18.10b , we note that one has 

mJoAh^H* < WoaW^vW JiM\l*^h> < CK\\JiM\1^hi < C , (8.12) 

where C is a universal constant independent of the initial condition. Here, we combined 
the bounds of Proposition 18.91 with Proposition |83] in order to obtain the last bound. 
We thus have 

||DPt¥J(u)|| La ^ R = \\BV2Vt-2<p(u)\\ L ^ H 2 

< E||D:P t _2¥Kw2)||ff^ R || ^o,2||l^h 2 

<C(||< /3 || L =c+e-^sup||D^)|| H ^ R )E(e 2 "ll tt2 ll 1/n ||Jo >2 |U 2 ^), 

V 

where we made use of d8.1 U to obtain the last inequality. The requested bound now 
follows from (18.12b and Proposition l8.8l □ 

8.3 Unique ergodicity of the stochastic Ginzburg-Landau equation 

In this section, we show under very weak conditions on the driving noise that the 
stochastic real Ginzburg-Landau equation has a unique invariant measure. Recall that 
this equation is given by 

d 

du(x, t) = vdlu{x, t) dt + r]u(x, t) dt - u 3 (x, tjdt + Y", 9j( x ) dWjit) , (8.13) 

3=1 

where the spatial variable x takes values on the circle x € S 1 and the driving func- 
tions gj belong to C°°(5 1 , R). The two positive parameters v and ?y are assumed to be 
fixed throughout this section. This is a particularly simple case of the type of equation 
considered above, so that Theorem 18.151 applies . The aim of this section is to show 
one possible technique for obtaining the uniqueness of the invariant measure for such 
a parabolic SPDE. It relies on Corollary 12. 21 and yields: 
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Theorem 8.17 Consider H8.13[ and suppose that 

1. there exists a linear combination g of the gj that has only finitely many simple 
zeroes, 

2. the smallest vector space containing all the gj and closed under the operation 
(/, g, h) i — ► fgh is dense in H 1 (S 1 ). 

Then A8.13i has exactly one invariant probability measure. 

Remark 8.18 Actually, we could have relaxed the regularity assumption on the gj's. 
If we choose Ti = H 1 , 7* = 2e — 1, a = 1 — e, and /3* = 1, we can check that 
Assumption IA. 1 1 is satisfied as soon as gj E H 1+Ae . Furthermore, in this case, all 
the relevant Lie brackets for assumption 2 in Theorem 18. 171 are admissible, so that its 
conclusion still holds. 

Looking at Corollary 12.21 the two main ingredients needed to prove Theorem l8.17l 
are the establishment of the estimate in ( 11.21 ) and the needed form of irreducibility. The 
first will follows almost instantly from the second assumption of Theorem 18.171 which 
ensures that span(A QO ) is dense in H 1 . The irreducibility is given by the following 
proposition whose proof is postponed to the end of this section. 

Proposition 8.19 Consider (18.131 under the second condition in Theorem \8.17\ Then 
there exists a positive K so that for any e > there is a v with \\v\\hi < K and a 
T > so that Vt(uoi BJv)) > for all uo G H 1 . Here B e (v) is the e ball in the 
H —norm. 

Proof of Theorem \8. 1 71 The existence of an invariant probability measure for (18.13b 
is standard, see for example jCer991 . Furthermore, since we are working in space 
dimension 1, H 1 is already a multiplicative algebra and one can retrace the proof of 
Theorem l8.15l for H = H 1 . This shows that assumption 2. implies that the semigroup 
generated by ( 18.131 satisfies (11.21 on the Hilbert space H = H 1 (S 1 ). It therefore 
remains to show that assumption 1 . implies the assumption of Corollary |2.2l □ 

In fact we have established much more than just uniqueness of the invariant mea- 
sure. We now use the results from |HM08l to establish a spectral gap. For any Frechet 
differentiable functions from ip : H 1 — > R define the norm H^Hup = suPud'M' 14 )! + 
||D^(u)||jj-i^r). In turn we define a metric on probability measures fi, v on H 1 by 
d(p, v) — sup{ f ipd\i — J ipdv : \\<p\\up < 1}- Combining (18.10b , Proposition |8.19| and 
HHM08I Theorem 2.5] yields the following corollary to Theorem |8.17| 

Corollary 8.20 Under the assumption of Theorem \8. 1 71 there exist positive constants 
C and 7 so that d(Vf fi, V*v) < Ce~ lt d([i, v)for any two probability measures \x and 
v on H 1 and t > 1. 

Proof of Proposition ^. 19\ Fix an arbitrary initial condition uq and some e > 0. Our 
aim is to find a target v, bounded controls Vj(t), and a terminal time T > 0, so that the 
solution to the controlled problem 

d 

d t u(x, t) = vd 2 x u{x, t) + rju(x, t) - u 3 (x, t) + f(x, t) , f(x, t) = ^ gj(x) Vj(t) , 

(8.14) 
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satisfies |ju(T) — v\\hi < e. Furthermore, we want to be able to choose v such that 
IMIff 1 < K for some constant K independent of e. The claim on the topological 
supports of transition probabilities then follows immediately from the fact that the Ito 
map (uo, W) i— > ut is continuous in the second argument in our case. 
The idea is to choose / of the form 



f(x,t) = 



e-~<g{x) for 1 < t < 2, 
otherwise, 



and to set T = 3. We furthermore set v to be the solution at time 1 for the uncontrolled 
equation (that is ( 18.14b with / = 0) with an initial condition vq satisfying 

vd%.v (x) + T}V Q {x) - v%(x) + e^gix) = , (8.15) 

for some exponent 7 > to be determined. Such a vq always exists since the coercive 
"energy functional" 

E(v) = J (^\d x v{x)\ 2 - \\v{x)\ 2 + j\v(x)\ 4 - e^g{x)v{x)) dx 

has at least one critical point. Even though vo is in general very large (see however 
Lemma [8.21 I below), it follows from (18.6b that the target v constructed in this way is 
bounded independently of e. 

The remaining ingredient of the proof are Lemmas |8.22| and 18.21 I below. To show 
that this is sufficient, note first that ( 18.61 ) implies the existence of a constant C such that 
|| u (1)||l 2 < C independently of uq. It then follows from Lemmas 18. 221 and 18.2 II that 
(choosing for example (3 = 7/14) there exists a constant C such that one has the bound 

\\u(2)-v \\ L , <Cei . 

Since the uncontrolled equation expands at rate at most 77, this immediately yields 
||w(T) — v\\i,2 < Cee . On the other hand, we know from Proposition 18.71 that there 
exists a constant C such that \\u(T) — u||jj2 < C, so that 

\\u(T) - v\\ m < (\\u(T) - v\\ L 4u(T) ~ v\\ H 2) 1/2 < Ce^ 12 , 
and the claim follows by choosing 7 > 12. □ 

Lemma 8.21 There exists a constant C v independent of e < 1 such that the bound 

\\v (x)\\ L °° < C v £-~<l z holds. 

Proof. It follows immediately from (18.15b , using the fact that 8 2 vq < at the maxi- 
mum and d^vo > at the minimum. □ 

Lemma 8.22 For every exponent (3 G [0, 7/4] there exists a constant C such that the 
bound 

2/3 f („, „, - 



(u - v Q )(u* - v$) dx > Ge- ip \ (u - v y dx - Ce 
s 1 Js 1 

holds for every e < 1 and every u £ L 2 (S' 1 ). 
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Proof. The proof is based on the fact that since g has only isolated zeroes, the function 
vq necessarily has the property that it is large at most points. More precisely, consider 
some exponent j3 G [0,7/3] and define the set A = {x 6 S 1 : |vo(x)| > £~' 9 }. 
We claim that there then exists a constant C such that the Lebesgue measure of A 
is bounded by \A\ < Ce a for a = min{7 — 3/3, -'g }• Indeed, consider the set A of 
points such that \g(x)\ < 2e a . Since g is assumed to be smooth and have simple zeroes, 
\A\ < Ce a and the complement of A consists of finitely many intervals on which g 
has a definite sign. 

Consider one such interval / on which g(x) > 9e a , so that the definition of vq 
yields the estimate v' ' < — 9e" -7 — vo + Vq. It follows that, for every x G I, one either 
has v'q(x) < — e a_7 , or one has v (x) > 2s - > 2e~^ (since we set a < 7 — 3/3). 
We conclude that I fl A consists of at most two intervals and that vq(x) > e^ 1 for 

7 — a — f3 

every x G / n A, so that \I n A\ < Ce 2 and the bound follows. (The same 
reasoning but with opposite signs applies to those intervals on which g(x) < —9e a .) 
This yields the sequence of bounds 

2 / (u - v )(u 3 - Vq) dx> (u- v Q ) 2 (u 2 + Vq) dx 
Js 1 Js 1 

> e~ 2p (u- v a ) 2 dx + (u- v a ) 2 (u 2 + vl) dx 



I A J A" 

> e" 2/3 (u- v ) 2 dx+] I (u- v ) 4 dx 

J A 4 J A a 

> £- 2f3 J^(u - v ) 2 dx + (J (u ~ vo) 2 dx) 2 

> s- 2 " J(u- vo) 2 dx + § (e a - 2 ? J (u - vo) 2 dx - e 2a ~ A P) 

> Ce~ 2f} I (u - vo) 2 dx - Ce a ~ i0 , 

Js 1 

which is the required estimate. □ 
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